Overview

Dataset statistics

Number of variables15
Number of observations956
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory112.2 KiB
Average record size in memory120.1 B

Variable types

Numeric3
Text8
Categorical4

Alerts

Unnamed: 0 is uniformly distributedUniform
Unnamed: 0 has unique valuesUnique

Reproduction

Analysis started2024-07-11 06:26:03.812484
Analysis finished2024-07-11 06:26:05.193797
Duration1.38 second
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

Unnamed: 0
Real number (ℝ)

UNIFORM  UNIQUE 

Distinct956
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean477.5
Minimum0
Maximum955
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size7.6 KiB
2024-07-11T11:56:05.266998image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile47.75
Q1238.75
median477.5
Q3716.25
95-th percentile907.25
Maximum955
Range955
Interquartile range (IQR)477.5

Descriptive statistics

Standard deviation276.11773
Coefficient of variation (CV)0.57825702
Kurtosis-1.2
Mean477.5
Median Absolute Deviation (MAD)239
Skewness0
Sum456490
Variance76241
MonotonicityStrictly increasing
2024-07-11T11:56:05.848390image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1
 
0.1%
642 1
 
0.1%
630 1
 
0.1%
631 1
 
0.1%
632 1
 
0.1%
633 1
 
0.1%
634 1
 
0.1%
635 1
 
0.1%
636 1
 
0.1%
637 1
 
0.1%
Other values (946) 946
99.0%
ValueCountFrequency (%)
0 1
0.1%
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
ValueCountFrequency (%)
955 1
0.1%
954 1
0.1%
953 1
0.1%
952 1
0.1%
951 1
0.1%
950 1
0.1%
949 1
0.1%
948 1
0.1%
947 1
0.1%
946 1
0.1%
Distinct328
Distinct (%)34.3%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-07-11T11:56:06.114366image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length131
Median length70
Mean length27.992678
Min length9

Characters and Unicode

Total characters26761
Distinct characters72
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)14.2%

Sample

1st rowData Scientist
2nd rowHealthcare Data Scientist
3rd rowData Scientist
4th rowData Scientist
5th rowData Scientist
ValueCountFrequency (%)
data 719
19.6%
scientist 540
 
14.7%
216
 
5.9%
engineer 201
 
5.5%
senior 152
 
4.1%
analyst 124
 
3.4%
sr 56
 
1.5%
analytics 49
 
1.3%
science 46
 
1.3%
manager 38
 
1.0%
Other values (398) 1532
41.7%
2024-07-11T11:56:06.486001image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2717
 
10.2%
t 2697
 
10.1%
a 2553
 
9.5%
i 2393
 
8.9%
e 2287
 
8.5%
n 2104
 
7.9%
c 1216
 
4.5%
s 1208
 
4.5%
r 1073
 
4.0%
S 999
 
3.7%
Other values (62) 7514
28.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 26761
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2717
 
10.2%
t 2697
 
10.1%
a 2553
 
9.5%
i 2393
 
8.9%
e 2287
 
8.5%
n 2104
 
7.9%
c 1216
 
4.5%
s 1208
 
4.5%
r 1073
 
4.0%
S 999
 
3.7%
Other values (62) 7514
28.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 26761
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2717
 
10.2%
t 2697
 
10.1%
a 2553
 
9.5%
i 2393
 
8.9%
e 2287
 
8.5%
n 2104
 
7.9%
c 1216
 
4.5%
s 1208
 
4.5%
r 1073
 
4.0%
S 999
 
3.7%
Other values (62) 7514
28.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 26761
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2717
 
10.2%
t 2697
 
10.1%
a 2553
 
9.5%
i 2393
 
8.9%
e 2287
 
8.5%
n 2104
 
7.9%
c 1216
 
4.5%
s 1208
 
4.5%
r 1073
 
4.0%
S 999
 
3.7%
Other values (62) 7514
28.1%
Distinct417
Distinct (%)43.6%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-07-11T11:56:06.735818image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length41
Median length36
Mean length21.610879
Min length2

Characters and Unicode

Total characters20660
Distinct characters37
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique196 ?
Unique (%)20.5%

Sample

1st row$53K-$91K (Glassdoor est.)
2nd row$63K-$112K (Glassdoor est.)
3rd row$80K-$90K (Glassdoor est.)
4th row$56K-$97K (Glassdoor est.)
5th row$86K-$143K (Glassdoor est.)
ValueCountFrequency (%)
est 725
29.5%
glassdoor 692
28.2%
1 214
 
8.7%
per 24
 
1.0%
hour(glassdoor 21
 
0.9%
provided 17
 
0.7%
employer 17
 
0.7%
54k-$115k 6
 
0.2%
49k-$113k 6
 
0.2%
21-$34 6
 
0.2%
Other values (414) 727
29.6%
2024-07-11T11:56:07.087686image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 2151
 
10.4%
1499
 
7.3%
o 1496
 
7.2%
$ 1484
 
7.2%
K 1436
 
7.0%
1 1133
 
5.5%
- 956
 
4.6%
r 824
 
4.0%
e 795
 
3.8%
l 759
 
3.7%
Other values (27) 8127
39.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 20660
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
s 2151
 
10.4%
1499
 
7.3%
o 1496
 
7.2%
$ 1484
 
7.2%
K 1436
 
7.0%
1 1133
 
5.5%
- 956
 
4.6%
r 824
 
4.0%
e 795
 
3.8%
l 759
 
3.7%
Other values (27) 8127
39.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 20660
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
s 2151
 
10.4%
1499
 
7.3%
o 1496
 
7.2%
$ 1484
 
7.2%
K 1436
 
7.0%
1 1133
 
5.5%
- 956
 
4.6%
r 824
 
4.0%
e 795
 
3.8%
l 759
 
3.7%
Other values (27) 8127
39.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 20660
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
s 2151
 
10.4%
1499
 
7.3%
o 1496
 
7.2%
$ 1484
 
7.2%
K 1436
 
7.0%
1 1133
 
5.5%
- 956
 
4.6%
r 824
 
4.0%
e 795
 
3.8%
l 759
 
3.7%
Other values (27) 8127
39.3%
Distinct596
Distinct (%)62.3%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-07-11T11:56:07.333232image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10051
Median length4223
Mean length3762.4215
Min length407

Characters and Unicode

Total characters3596875
Distinct characters123
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique314 ?
Unique (%)32.8%

Sample

1st rowData Scientist Location: Albuquerque, NM Education Required: Bachelor’s degree required, preferably in math, engineering, business, or the sciences. Skills Required: Bachelor’s Degree in relevant field, e.g., math, data analysis, database, computer science, Artificial Intelligence (AI); three years’ experience credit for Master’s degree; five years’ experience credit for a Ph.D Applicant should be proficient in the use of Power BI, Tableau, Python, MATLAB, Microsoft Word, PowerPoint, Excel, and working knowledge of MS Access, LMS, SAS, data visualization tools, and have a strong algorithmic aptitude Excellent verbal and written communication skills, and quantitative analytical skills are required Applicant must be able to work in a team environment U.S. citizenship and ability to obtain a DoD Secret Clearance required Responsibilities: The applicant will be responsible for formulating analytical solutions to complex data problems; creating data analytic models to improve data metrics; analyzing customer behavior and trends; delivering insights to stakeholders, as well as designing and crafting reports, dashboards, models, and algorithms to make data insights actionable; selecting features, building and optimizing classifiers using machine learning techniques; data mining using state-of-the-art methods, extending organization’s data with third party sources of information when needed; enhancing data collection procedures to include information that is relevant for building analytic systems; processing, cleansing, and verifying the integrity of data used for analysis; doing ad-hoc analysis and presenting results in a clear manner; and creating automated anomaly detection systems and constant tracking of its performance. Benefits: We offer competitive salaries commensurate with education and experience. We have an excellent benefits package that includes: Comprehensive health, dental, life, long and short term disability insurance 100% Company funded Retirement Plans Generous vacation, holiday and sick pay plans Tuition assistance Benefits are provided to employees regularly working a minimum of 30 hours per week. Tecolote Research is a private, employee-owned corporation where people are our primary resource. Our investments in technology and training give our employees the tools to ensure our clients are provided the solutions they need, and our very high employee retention rate and stable workforce is an added value to our customers. Apply now to connect with a company that invests in you.
2nd rowWhat You Will Do: I. General Summary The Healthcare Data Scientist position will join our Advanced Analytics group at the University of Maryland Medical System (UMMS) in support of its strategic priority to become a data-driven and outcomes-oriented organization. The successful candidate will have 3+ years of experience with Machine Learning, Predictive Modeling, Statistical Analysis, Mathematical Optimization, Algorithm Development and a passion for working with healthcare data. Previous experience with various computational approaches along with an ability to demonstrate a portfolio of relevant prior projects is essential. This position will report to the UMMS Vice President for Enterprise Data and Analytics (ED&A). II. Principal Responsibilities and Tasks • Develops predictive and prescriptive analytic models in support of the organization’s clinical, operations and business initiatives and priorities. • Deploys solutions so that they provide actionable insights to the organization and are embedded or integrated with application systems • Supports and drives analytic efforts designed around organization’s strategic priorities and clinical/business problems • Works in a team to drive disruptive innovation, which may translate into improved quality of care, clinical outcomes, reduced costs, temporal efficiencies and process improvements. • Builds and extends our analytics portfolio supported by robust documentation • Works with autonomy to find solutions to complex problems using open source tools and in-house development • Stays abreast of state-of-the-art literature in the fields of operations research, statistical modeling, statistical process control and mathematical optimization • Creates, communicates, and manages the project plans and other required project documentation and provides updates to leadership as necessary • Develops and maintains relationships with business, IT and clinical leaders and stakeholders across the enterprise to facilitate collaboration and effective communication • Works with the analytics team and clinical/business stakeholders to develop pilots so that they may be tested and validated in pilot settings • Performs analysis to evaluate primary and secondary objectives from such pilots • Assists leadership with strategies for scaling successful projects across the organization and enhances the analytics applications based on feedback from end-users and clinical/business consumers • Assists leadership with dissemination of success stories (and failures) in an effort to increase analytics literacy and adoption across the organization. What You Need to Be Successful: III. Education and Experience • Master’s or higher degree (may be substituted by relevant work experience) in applied mathematics, physics, computer science, engineering, statistics or a related field • 3+ years of Mathematical Optimization, Machine Learning, Predictive Analytics and Algorithm Development experience (experience with tools such as WEKA, RapidMiner, R. Python or other open source tools strongly desired) • Strong development skills in two or more of the following: C/C++, C#, Python, Java • Combining analytic methods with advanced data visualizations • Expert ability to breakdown and clearly define problems • Experience with Natural Language Processing preferred IV. Knowledge, Skills and Abilities • Proven communications skills – Effective at working independently and in collaboration with other staff members. Capable of clearly presenting findings orally, in writing, or through graphics. • Proven analytical skills – Able to compare, contrast, and validate work with keen attention to detail. Skilled in working with “real world” data including scrubbing, transformation, and imputation. • Proven problem solving skills – Able to plan work, set clear direction, and coordinate own tasks in a fast-paced multidisciplinary environment. Expert at triaging issues, identifying data anomalies, and debugging software. • Design and prototype new application functionality for our products. • Change oriented – actively generates process improvements; supports and drives change, and confronts difficult circumstances in creative ways • Effective communicator and change agent • Ability to prioritize the tasks of the project timeline to achieve the desired results • Strong analytic and problem solving skills • Ability to cooperatively and effectively work with people from various organization levels We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.
3rd rowKnowBe4, Inc. is a high growth information security company. We are the world's largest provider of new-school security awareness training and simulated phishing. KnowBe4 was created to help organizations manage the ongoing problem of social engineering. Tens of thousands of organizations worldwide use KnowBe4's platform to mobilize their end users as a last line of defense and enable them to make better security decisions, every day. We are ranked #1 best place to work in technology nationwide by Fortune Magazine and have placed #1 or #2 in The Tampa Bay Top Workplaces Survey for the last four years. We also just had our 27th record-setting quarter in a row! The Data Scientist will work closely with the VP of FP&A and the Quantitative Analytics Manager to implement advanced analytical models and other data-driven solutions. Responsibilities: Work with key stakeholders throughout the organization to identify opportunities using financial data to develop business solutions. Develop new and enhance existing data collection procedures to ensure that all data relevant for analyses is captured. Cleanse, consolidate, and verify the integrity of data used in analyses. Build and validate predictive models to increase customer retention, revenue generation, and other business outcomes. Develop relevant statistical models to assist with profitability forecasting Create the analytics to leverage known, inferred and appended information about origins and recognizing patterns to assist in outlook forecasting Assist in the design and data modeling of data warehouse. Visualize data, especially in reports and dashboards, to communicate analysis results to stakeholders. Extend data collection to unstructured data within the company and external sources Mine and collect data (both structured and unstructured) to detect patterns, opportunities and insights that drive our organization Create and execute automation and data mining requests utilizing SQL, Access, Excel, SAS and other statistical programs Trouble shoot forecast and optimization anomalies with FP&A team through the use of statistical and mathematical optimization models. Develop testing to explain and or reduce these anomalies. Oversee and develop key metric forecasts as well as provide budget support based on trends in the business/industry. Minimum Qualifications: Master's degree in Statistics, Computer Science, Mathematics or other quantitative discipline required 2-3 years of experience in similar role (Master's Degree) 0-2 years of experience in similar role (PhD) Experience leveraging predictive modeling, big data analytics, exploratory data analysis and machine learning to drive significant business impact Experience with statistical computer languages (Python, R etc.) to manipulate and analyze large datasets preferred. Experience with data visualization tools like D3.js, matplotlib, etc., preferred Exceptional understanding of machine learning algorithms such as Random Forest, SVM, k-NN, Naïve Bayes, Gradient Boosting a plus. Applied statistical skills including statistical testing, regression, etc. Experience with data bases, query languages, and associated data architecture. Experience with distributed computing tools (Hive, Spark, etc.) is a plus. Strong analytical skills and ability to meet project deadlines. Note: An applicant assessment, background check and drug test may be part of your hiring procedure. No recruitment agencies, please.
4th row*Organization and Job ID** Job ID: 310709 Directorate: Earth & Biological Sciences Division: Biological Sciences Group: Exposure Science Team *Job Description** The Biological System Science (BSS) Group in the Biological Sciences Division of the Pacific Northwest National Laboratory (PNNL) is seeking a staff scientist with multidisciplinary experience in computational chemistry, cheminformatics, advanced statistics and/or machine learning/deep learning/AI. Preferred candidates will have a broad understanding of the state of computational metabolomics and experience in designing and implementing novel deep learning networks for chemistry applications. Research experience in drug design, cheminformatics, deep learning, machine learning and/or small molecule identification is also highly valued. Successful candidates will join a large, uniquely collaborative, collegial group of innovators driving the integration of data science, computational science and analytical chemistry to solve the nations most challenging problems in human health, chemical forensics, and national security. The BSS Group is diverse and inclusive, working closely with colleagues across the laboratory with expertise in computational biology, integrative omics, applied mathematics, computer science, and statistics. + Apply knowledge of statistics, machine learning, advanced mathematics, simulation, software development, and data modeling to to design, development and implement methods that integrate, clean and analyze data, recognize patterns, address uncertainty, pose questions, and make discoveries from structured and/or unstructured data. + Produce solutions driven by exploratory data analysis from complex and high-dimensional datasets. + Design, develop, and evaluate predictive models and advanced algorithms that lead to optimal value extraction from data. + Develop and maintain existing deep learning networks that generate novel molecules for drug discovery applications + Contribue or author proposals, peer-reviewed papers, and other technical products. *Minimum Qualifications** BS/BA with 0-1 years of experience or MS/MA with 0-1 years of experience *Preferred Qualifications** + MS in chemical engineering, computer science, or related field with a GPA of 3.5+ 5+ years of research experience + Intermediate level programming experience (preferably Python) and high-performance computing experience + At least one first author published, or proof of submitted, paper applying deep learning for use in novel compound generation + Understanding of the NMDA receptor and potential drug targets + Research experience in drug design, cheminformatics, deep learning, machine learning and/or small molecule identification *Equal Employment Opportunity** Battelle Memorial Institute (BMI) at Pacific Northwest National Laboratory (PNNL) is an Affirmative Action/Equal Opportunity Employer and supports diversity in the workplace. All employment decisions are made without regard to race, color, religion, sex, national origin, age, disability, veteran status, marital or family status, sexual orientation, gender identity, or genetic information. All BMI staff must be able to demonstrate the legal right to work in the United States. BMI is an E-Verify employer. Learn more at jobs.pnnl.gov. *_Please be aware that the Department of Energy (DOE) prohibits DOE employees and contractors from participation in certain foreign government talent recruitment programs. If you are offered a position at PNNL and are currently a participant in a foreign government talent recruitment program you will be required to disclose this information before your first day of employment._** _Directorate:_ _Earth & Biological Sciences_ _Job Category:_ _Scientists/Scientific Support_ _Group:_ _Biological Systems Science_ _Opening Date:_ _2020-03-26_ _Closing Date:_ _2020-04-05_
5th rowData Scientist Affinity Solutions / Marketing Cloud seeks smart, curious, technically savvy candidates to join our cutting-edge data science team. We hire the best and brightest and give them the opportunity to work on industry-leading technologies. The data sciences team at AFS/Marketing Cloud build models, machine learning algorithms that power all our ad-tech/mar-tech products at scale, develop methodology and tools to precisely and effectively measure market campaign effects, and research in-house and public data sources for consumer spend behavior insights. In this role, you'll have the opportunity to come up with new ideas and solutions that will lead to improvement of our ability to target the right audience, derive insights and provide better measurement methodology for marketing campaigns. You'll access our core data asset and machine learning infrastructure to power your ideas. Duties and Responsibilities · Support all clients model building needs, including maintaining and improving current modeling/scoring methodology and processes, · Provide innovative solutions to customized modeling/scoring/targeting with appropriate ML/statistical tools, · Provide analytical/statistical support such as marketing test design, projection, campaign measurement, market insights to clients and stakeholders. · Mine large consumer datasets in the cloud environment to support ad hoc business and statistical analysis, · Develop and Improve automation capabilities to enable customized delivery of the analytical products to clients, · Communicate the methodologies and the results to the management, clients and none technical stakeholders. Basic Qualifications · Advanced degree in Statistics/Mathematics/Computer Science/Economics or other fields that requires advanced training in data analytics. · Being able to apply basic statistical/ML concepts and reasoning to address and solve business problems such as targeting, test design, KPI projection and performance measurement. · Entrepreneurial, highly self-motivated, collaborative, keen attention to detail, willingness and capable learn quickly, and ability to effectively prioritize and execute tasks in a high pressure environment. · Being flexible to accept different task assignments and able to work on a tight time schedule. · Excellent command of one or more programming languages; preferably Python, SAS or R · Familiar with one of the database technologies such as PostgreSQL, MySQL, can write basic SQL queries · Great communication skills (verbal, written and presentation) Preferred Qualifications · Experience or exposure to large consumer and/or demographic data sets. · Familiarity with data manipulation and cleaning routines and techniques.
ValueCountFrequency (%)
and 29770
 
5.9%
to 16076
 
3.2%
the 12450
 
2.5%
of 12013
 
2.4%
data 9615
 
1.9%
in 8900
 
1.8%
a 8163
 
1.6%
with 7478
 
1.5%
for 5494
 
1.1%
experience 4747
 
0.9%
Other values (14720) 391241
77.3%
2024-07-11T11:56:07.813277image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
475725
13.2%
e 338051
 
9.4%
i 249004
 
6.9%
a 246924
 
6.9%
t 243141
 
6.8%
n 236132
 
6.6%
o 210528
 
5.9%
s 186368
 
5.2%
r 185829
 
5.2%
l 132874
 
3.7%
Other values (113) 1092299
30.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3596875
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
475725
13.2%
e 338051
 
9.4%
i 249004
 
6.9%
a 246924
 
6.9%
t 243141
 
6.8%
n 236132
 
6.6%
o 210528
 
5.9%
s 186368
 
5.2%
r 185829
 
5.2%
l 132874
 
3.7%
Other values (113) 1092299
30.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3596875
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
475725
13.2%
e 338051
 
9.4%
i 249004
 
6.9%
a 246924
 
6.9%
t 243141
 
6.8%
n 236132
 
6.6%
o 210528
 
5.9%
s 186368
 
5.2%
r 185829
 
5.2%
l 132874
 
3.7%
Other values (113) 1092299
30.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3596875
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
475725
13.2%
e 338051
 
9.4%
i 249004
 
6.9%
a 246924
 
6.9%
t 243141
 
6.8%
n 236132
 
6.6%
o 210528
 
5.9%
s 186368
 
5.2%
r 185829
 
5.2%
l 132874
 
3.7%
Other values (113) 1092299
30.4%

Rating
Real number (ℝ)

Distinct32
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.6012552
Minimum-1
Maximum5
Zeros0
Zeros (%)0.0%
Negative34
Negative (%)3.6%
Memory size7.6 KiB
2024-07-11T11:56:07.901605image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-1
5-th percentile2.3
Q13.3
median3.8
Q34.2
95-th percentile4.7
Maximum5
Range6
Interquartile range (IQR)0.9

Descriptive statistics

Standard deviation1.0676188
Coefficient of variation (CV)0.29645742
Kurtosis9.6793285
Mean3.6012552
Median Absolute Deviation (MAD)0.4
Skewness-2.7421713
Sum3442.8
Variance1.1398099
MonotonicityNot monotonic
2024-07-11T11:56:07.976138image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
3.8 75
 
7.8%
3.7 69
 
7.2%
3.9 69
 
7.2%
3.6 56
 
5.9%
4 56
 
5.9%
3.5 53
 
5.5%
3.4 47
 
4.9%
4.4 46
 
4.8%
3.3 44
 
4.6%
4.2 41
 
4.3%
Other values (22) 400
41.8%
ValueCountFrequency (%)
-1 34
3.6%
1.9 3
 
0.3%
2.1 5
 
0.5%
2.2 3
 
0.3%
2.3 4
 
0.4%
2.4 8
 
0.8%
2.5 3
 
0.3%
2.6 14
1.5%
2.7 17
1.8%
2.8 7
 
0.7%
ValueCountFrequency (%)
5 28
2.9%
4.9 4
 
0.4%
4.8 14
 
1.5%
4.7 38
4.0%
4.6 18
 
1.9%
4.5 19
2.0%
4.4 46
4.8%
4.3 39
4.1%
4.2 41
4.3%
4.1 37
3.9%
Distinct448
Distinct (%)46.9%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-07-11T11:56:08.346675image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length55
Median length39
Mean length19.302301
Min length4

Characters and Unicode

Total characters18453
Distinct characters75
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique220 ?
Unique (%)23.0%

Sample

1st rowTecolote Research 3.8
2nd rowUniversity of Maryland Medical System 3.4
3rd rowKnowBe4 4.8
4th rowPNNL 3.8
5th rowAffinity Solutions 2.9
ValueCountFrequency (%)
3.8 75
 
2.6%
3.7 69
 
2.4%
3.9 69
 
2.4%
3.6 56
 
1.9%
4.0 56
 
1.9%
3.5 53
 
1.8%
3.4 47
 
1.6%
4.4 46
 
1.6%
3.3 44
 
1.5%
4.2 41
 
1.4%
Other values (676) 2337
80.8%
2024-07-11T11:56:08.778453image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1322
 
7.2%
a 1118
 
6.1%
1015
 
5.5%
. 978
 
5.3%
922
 
5.0%
n 912
 
4.9%
t 900
 
4.9%
i 891
 
4.8%
r 839
 
4.5%
o 832
 
4.5%
Other values (65) 8724
47.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 18453
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1322
 
7.2%
a 1118
 
6.1%
1015
 
5.5%
. 978
 
5.3%
922
 
5.0%
n 912
 
4.9%
t 900
 
4.9%
i 891
 
4.8%
r 839
 
4.5%
o 832
 
4.5%
Other values (65) 8724
47.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 18453
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1322
 
7.2%
a 1118
 
6.1%
1015
 
5.5%
. 978
 
5.3%
922
 
5.0%
n 912
 
4.9%
t 900
 
4.9%
i 891
 
4.8%
r 839
 
4.5%
o 832
 
4.5%
Other values (65) 8724
47.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 18453
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1322
 
7.2%
a 1118
 
6.1%
1015
 
5.5%
. 978
 
5.3%
922
 
5.0%
n 912
 
4.9%
t 900
 
4.9%
i 891
 
4.8%
r 839
 
4.5%
o 832
 
4.5%
Other values (65) 8724
47.3%
Distinct237
Distinct (%)24.8%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-07-11T11:56:09.100042image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length33
Median length22
Mean length13.17887
Min length6

Characters and Unicode

Total characters12599
Distinct characters54
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)9.3%

Sample

1st rowAlbuquerque, NM
2nd rowLinthicum, MD
3rd rowClearwater, FL
4th rowRichland, WA
5th rowNew York, NY
ValueCountFrequency (%)
ca 211
 
9.4%
ma 124
 
5.5%
san 121
 
5.4%
ny 96
 
4.3%
francisco 89
 
4.0%
new 82
 
3.7%
york 78
 
3.5%
cambridge 60
 
2.7%
va 56
 
2.5%
il 48
 
2.1%
Other values (293) 1280
57.0%
2024-07-11T11:56:09.520252image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1289
 
10.2%
, 947
 
7.5%
a 809
 
6.4%
o 688
 
5.5%
n 688
 
5.5%
e 653
 
5.2%
i 630
 
5.0%
A 578
 
4.6%
r 557
 
4.4%
l 437
 
3.5%
Other values (44) 5323
42.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 12599
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1289
 
10.2%
, 947
 
7.5%
a 809
 
6.4%
o 688
 
5.5%
n 688
 
5.5%
e 653
 
5.2%
i 630
 
5.0%
A 578
 
4.6%
r 557
 
4.4%
l 437
 
3.5%
Other values (44) 5323
42.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 12599
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1289
 
10.2%
, 947
 
7.5%
a 809
 
6.4%
o 688
 
5.5%
n 688
 
5.5%
e 653
 
5.2%
i 630
 
5.0%
A 578
 
4.6%
r 557
 
4.4%
l 437
 
3.5%
Other values (44) 5323
42.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 12599
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1289
 
10.2%
, 947
 
7.5%
a 809
 
6.4%
o 688
 
5.5%
n 688
 
5.5%
e 653
 
5.2%
i 630
 
5.0%
A 578
 
4.6%
r 557
 
4.4%
l 437
 
3.5%
Other values (44) 5323
42.2%
Distinct235
Distinct (%)24.6%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-07-11T11:56:09.814130image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length26
Median length22
Mean length13.605649
Min length2

Characters and Unicode

Total characters13007
Distinct characters55
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)8.7%

Sample

1st rowGoleta, CA
2nd rowBaltimore, MD
3rd rowClearwater, FL
4th rowRichland, WA
5th rowNew York, NY
ValueCountFrequency (%)
ca 223
 
9.7%
san 103
 
4.5%
ma 101
 
4.4%
ny 86
 
3.7%
new 78
 
3.4%
york 75
 
3.2%
va 72
 
3.1%
francisco 65
 
2.8%
il 42
 
1.8%
pa 36
 
1.6%
Other values (297) 1427
61.8%
2024-07-11T11:56:10.217291image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1352
 
10.4%
, 945
 
7.3%
a 858
 
6.6%
n 770
 
5.9%
e 715
 
5.5%
o 673
 
5.2%
i 632
 
4.9%
A 579
 
4.5%
r 547
 
4.2%
l 494
 
3.8%
Other values (45) 5442
41.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 13007
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1352
 
10.4%
, 945
 
7.3%
a 858
 
6.6%
n 770
 
5.9%
e 715
 
5.5%
o 673
 
5.2%
i 632
 
4.9%
A 579
 
4.5%
r 547
 
4.2%
l 494
 
3.8%
Other values (45) 5442
41.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 13007
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1352
 
10.4%
, 945
 
7.3%
a 858
 
6.6%
n 770
 
5.9%
e 715
 
5.5%
o 673
 
5.2%
i 632
 
4.9%
A 579
 
4.5%
r 547
 
4.2%
l 494
 
3.8%
Other values (45) 5442
41.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 13007
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1352
 
10.4%
, 945
 
7.3%
a 858
 
6.6%
n 770
 
5.9%
e 715
 
5.5%
o 673
 
5.2%
i 632
 
4.9%
A 579
 
4.5%
r 547
 
4.2%
l 494
 
3.8%
Other values (45) 5442
41.8%

Size
Categorical

Distinct9
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
1001 to 5000 employees
177 
201 to 500 employees
160 
51 to 200 employees
155 
10000+ employees
154 
501 to 1000 employees
144 
Other values (4)
166 

Length

Max length23
Median length21
Mean length19.359833
Min length2

Characters and Unicode

Total characters18508
Distinct characters19
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row501 to 1000 employees
2nd row10000+ employees
3rd row501 to 1000 employees
4th row1001 to 5000 employees
5th row51 to 200 employees

Common Values

ValueCountFrequency (%)
1001 to 5000 employees 177
18.5%
201 to 500 employees 160
16.7%
51 to 200 employees 155
16.2%
10000+ employees 154
16.1%
501 to 1000 employees 144
15.1%
5001 to 10000 employees 79
8.3%
1 to 50 employees 61
 
6.4%
Unknown 15
 
1.6%
-1 11
 
1.2%

Length

2024-07-11T11:56:10.307005image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-11T11:56:10.372933image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
ValueCountFrequency (%)
employees 930
27.1%
to 776
22.6%
10000 233
 
6.8%
1001 177
 
5.1%
5000 177
 
5.1%
201 160
 
4.7%
500 160
 
4.7%
51 155
 
4.5%
200 155
 
4.5%
501 144
 
4.2%
Other values (5) 371
 
10.8%

Most occurring characters

ValueCountFrequency (%)
0 3402
18.4%
e 2790
15.1%
2482
13.4%
o 1721
9.3%
1 1341
 
7.2%
p 930
 
5.0%
s 930
 
5.0%
y 930
 
5.0%
l 930
 
5.0%
m 930
 
5.0%
Other values (9) 2122
11.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 18508
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 3402
18.4%
e 2790
15.1%
2482
13.4%
o 1721
9.3%
1 1341
 
7.2%
p 930
 
5.0%
s 930
 
5.0%
y 930
 
5.0%
l 930
 
5.0%
m 930
 
5.0%
Other values (9) 2122
11.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 18508
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 3402
18.4%
e 2790
15.1%
2482
13.4%
o 1721
9.3%
1 1341
 
7.2%
p 930
 
5.0%
s 930
 
5.0%
y 930
 
5.0%
l 930
 
5.0%
m 930
 
5.0%
Other values (9) 2122
11.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 18508
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 3402
18.4%
e 2790
15.1%
2482
13.4%
o 1721
9.3%
1 1341
 
7.2%
p 930
 
5.0%
s 930
 
5.0%
y 930
 
5.0%
l 930
 
5.0%
m 930
 
5.0%
Other values (9) 2122
11.5%

Founded
Real number (ℝ)

Distinct109
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1774.6056
Minimum-1
Maximum2019
Zeros0
Zeros (%)0.0%
Negative97
Negative (%)10.1%
Memory size7.6 KiB
2024-07-11T11:56:10.474382image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-1
5-th percentile-1
Q11937
median1992
Q32008
95-th percentile2015
Maximum2019
Range2020
Interquartile range (IQR)71

Descriptive statistics

Standard deviation598.94252
Coefficient of variation (CV)0.33750739
Kurtosis4.9000164
Mean1774.6056
Median Absolute Deviation (MAD)20
Skewness-2.6126243
Sum1696523
Variance358732.14
MonotonicityNot monotonic
2024-07-11T11:56:10.587149image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-1 97
 
10.1%
2008 40
 
4.2%
2010 38
 
4.0%
1996 38
 
4.0%
2013 33
 
3.5%
2006 26
 
2.7%
2002 25
 
2.6%
2012 24
 
2.5%
2007 22
 
2.3%
2011 22
 
2.3%
Other values (99) 591
61.8%
ValueCountFrequency (%)
-1 97
10.1%
1744 1
 
0.1%
1781 14
 
1.5%
1812 1
 
0.1%
1830 4
 
0.4%
1846 2
 
0.2%
1849 7
 
0.7%
1850 1
 
0.1%
1851 14
 
1.5%
1852 5
 
0.5%
ValueCountFrequency (%)
2019 4
 
0.4%
2018 2
 
0.2%
2017 15
 
1.6%
2016 15
 
1.6%
2015 21
2.2%
2014 21
2.2%
2013 33
3.5%
2012 24
2.5%
2011 22
2.3%
2010 38
4.0%
Distinct13
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
Company - Private
532 
Company - Public
237 
Nonprofit Organization
65 
Subsidiary or Business Segment
 
40
Government
 
17
Other values (8)
65 

Length

Max length30
Median length17
Mean length17.108787
Min length2

Characters and Unicode

Total characters16356
Distinct characters38
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowCompany - Private
2nd rowOther Organization
3rd rowCompany - Private
4th rowGovernment
5th rowCompany - Private

Common Values

ValueCountFrequency (%)
Company - Private 532
55.6%
Company - Public 237
24.8%
Nonprofit Organization 65
 
6.8%
Subsidiary or Business Segment 40
 
4.2%
Government 17
 
1.8%
Hospital 15
 
1.6%
College / University 15
 
1.6%
Unknown 11
 
1.2%
-1 11
 
1.2%
Other Organization 5
 
0.5%
Other values (3) 8
 
0.8%

Length

2024-07-11T11:56:10.713850image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
787
28.9%
company 769
28.2%
private 533
19.6%
public 237
 
8.7%
organization 70
 
2.6%
nonprofit 65
 
2.4%
subsidiary 40
 
1.5%
or 40
 
1.5%
business 40
 
1.5%
segment 40
 
1.5%
Other values (12) 102
 
3.7%

Most occurring characters

ValueCountFrequency (%)
1767
 
10.8%
a 1503
 
9.2%
i 1146
 
7.0%
n 1141
 
7.0%
o 1080
 
6.6%
p 849
 
5.2%
m 827
 
5.1%
y 824
 
5.0%
r 794
 
4.9%
C 789
 
4.8%
Other values (28) 5636
34.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 16356
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1767
 
10.8%
a 1503
 
9.2%
i 1146
 
7.0%
n 1141
 
7.0%
o 1080
 
6.6%
p 849
 
5.2%
m 827
 
5.1%
y 824
 
5.0%
r 794
 
4.9%
C 789
 
4.8%
Other values (28) 5636
34.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 16356
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1767
 
10.8%
a 1503
 
9.2%
i 1146
 
7.0%
n 1141
 
7.0%
o 1080
 
6.6%
p 849
 
5.2%
m 827
 
5.1%
y 824
 
5.0%
r 794
 
4.9%
C 789
 
4.8%
Other values (28) 5636
34.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 16356
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1767
 
10.8%
a 1503
 
9.2%
i 1146
 
7.0%
n 1141
 
7.0%
o 1080
 
6.6%
p 849
 
5.2%
m 827
 
5.1%
y 824
 
5.0%
r 794
 
4.9%
C 789
 
4.8%
Other values (28) 5636
34.5%
Distinct63
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-07-11T11:56:10.926812image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length40
Median length35
Mean length21.130753
Min length2

Characters and Unicode

Total characters20201
Distinct characters52
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)1.4%

Sample

1st rowAerospace & Defense
2nd rowHealth Care Services & Hospitals
3rd rowSecurity Services
4th rowEnergy
5th rowAdvertising & Marketing
ValueCountFrequency (%)
532
19.7%
services 150
 
5.6%
biotech 148
 
5.5%
pharmaceuticals 148
 
5.5%
software 126
 
4.7%
it 77
 
2.8%
insurance 71
 
2.6%
computer 70
 
2.6%
hardware 70
 
2.6%
carriers 65
 
2.4%
Other values (105) 1245
46.1%
2024-07-11T11:56:11.251527image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 2143
 
10.6%
1746
 
8.6%
r 1600
 
7.9%
a 1469
 
7.3%
t 1319
 
6.5%
i 1258
 
6.2%
s 1177
 
5.8%
n 1049
 
5.2%
c 955
 
4.7%
o 952
 
4.7%
Other values (42) 6533
32.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 20201
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 2143
 
10.6%
1746
 
8.6%
r 1600
 
7.9%
a 1469
 
7.3%
t 1319
 
6.5%
i 1258
 
6.2%
s 1177
 
5.8%
n 1049
 
5.2%
c 955
 
4.7%
o 952
 
4.7%
Other values (42) 6533
32.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 20201
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 2143
 
10.6%
1746
 
8.6%
r 1600
 
7.9%
a 1469
 
7.3%
t 1319
 
6.5%
i 1258
 
6.2%
s 1177
 
5.8%
n 1049
 
5.2%
c 955
 
4.7%
o 952
 
4.7%
Other values (42) 6533
32.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 20201
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 2143
 
10.6%
1746
 
8.6%
r 1600
 
7.9%
a 1469
 
7.3%
t 1319
 
6.5%
i 1258
 
6.2%
s 1177
 
5.8%
n 1049
 
5.2%
c 955
 
4.7%
o 952
 
4.7%
Other values (42) 6533
32.3%

Sector
Categorical

Distinct25
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
Information Technology
239 
Biotech & Pharmaceuticals
148 
Business Services
134 
Insurance
71 
Finance
56 
Other values (20)
308 

Length

Max length34
Median length28
Mean length16.828452
Min length2

Characters and Unicode

Total characters16088
Distinct characters42
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st rowAerospace & Defense
2nd rowHealth Care
3rd rowBusiness Services
4th rowOil, Gas, Energy & Utilities
5th rowBusiness Services

Common Values

ValueCountFrequency (%)
Information Technology 239
25.0%
Biotech & Pharmaceuticals 148
15.5%
Business Services 134
14.0%
Insurance 71
 
7.4%
Finance 56
 
5.9%
Health Care 51
 
5.3%
Manufacturing 40
 
4.2%
-1 39
 
4.1%
Aerospace & Defense 32
 
3.3%
Education 26
 
2.7%
Other values (15) 120
12.6%

Length

2024-07-11T11:56:11.342146image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
information 239
12.7%
technology 239
12.7%
224
11.9%
biotech 148
 
7.9%
pharmaceuticals 148
 
7.9%
services 138
 
7.3%
business 134
 
7.1%
insurance 71
 
3.8%
finance 56
 
3.0%
health 51
 
2.7%
Other values (34) 430
22.9%

Most occurring characters

ValueCountFrequency (%)
e 1493
 
9.3%
n 1376
 
8.6%
o 1269
 
7.9%
a 1164
 
7.2%
i 1106
 
6.9%
c 1081
 
6.7%
922
 
5.7%
s 915
 
5.7%
r 823
 
5.1%
t 811
 
5.0%
Other values (32) 5128
31.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 16088
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1493
 
9.3%
n 1376
 
8.6%
o 1269
 
7.9%
a 1164
 
7.2%
i 1106
 
6.9%
c 1081
 
6.7%
922
 
5.7%
s 915
 
5.7%
r 823
 
5.1%
t 811
 
5.0%
Other values (32) 5128
31.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 16088
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1493
 
9.3%
n 1376
 
8.6%
o 1269
 
7.9%
a 1164
 
7.2%
i 1106
 
6.9%
c 1081
 
6.7%
922
 
5.7%
s 915
 
5.7%
r 823
 
5.1%
t 811
 
5.0%
Other values (32) 5128
31.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 16088
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1493
 
9.3%
n 1376
 
8.6%
o 1269
 
7.9%
a 1164
 
7.2%
i 1106
 
6.9%
c 1081
 
6.7%
922
 
5.7%
s 915
 
5.7%
r 823
 
5.1%
t 811
 
5.0%
Other values (32) 5128
31.9%

Revenue
Categorical

Distinct14
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
Unknown / Non-Applicable
299 
$10+ billion (USD)
140 
$100 to $500 million (USD)
107 
$1 to $2 billion (USD)
68 
$500 million to $1 billion (USD)
62 
Other values (9)
280 

Length

Max length32
Median length26
Mean length23.362971
Min length2

Characters and Unicode

Total characters22335
Distinct characters32
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row$50 to $100 million (USD)
2nd row$2 to $5 billion (USD)
3rd row$100 to $500 million (USD)
4th row$500 million to $1 billion (USD)
5th rowUnknown / Non-Applicable

Common Values

ValueCountFrequency (%)
Unknown / Non-Applicable 299
31.3%
$10+ billion (USD) 140
14.6%
$100 to $500 million (USD) 107
 
11.2%
$1 to $2 billion (USD) 68
 
7.1%
$500 million to $1 billion (USD) 62
 
6.5%
$25 to $50 million (USD) 59
 
6.2%
$50 to $100 million (USD) 52
 
5.4%
$2 to $5 billion (USD) 44
 
4.6%
$10 to $25 million (USD) 39
 
4.1%
$5 to $10 million (USD) 29
 
3.0%
Other values (4) 57
 
6.0%

Length

2024-07-11T11:56:11.420272image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
usd 646
16.5%
to 497
12.7%
million 374
9.5%
billion 334
8.5%
unknown 299
7.6%
299
7.6%
non-applicable 299
7.6%
10 228
 
5.8%
500 169
 
4.3%
1 167
 
4.3%
Other values (7) 608
15.5%

Most occurring characters

ValueCountFrequency (%)
2964
 
13.3%
l 2014
 
9.0%
n 1913
 
8.6%
o 1803
 
8.1%
i 1715
 
7.7%
$ 1143
 
5.1%
0 995
 
4.5%
U 945
 
4.2%
( 646
 
2.9%
S 646
 
2.9%
Other values (22) 7551
33.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 22335
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2964
 
13.3%
l 2014
 
9.0%
n 1913
 
8.6%
o 1803
 
8.1%
i 1715
 
7.7%
$ 1143
 
5.1%
0 995
 
4.5%
U 945
 
4.2%
( 646
 
2.9%
S 646
 
2.9%
Other values (22) 7551
33.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 22335
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2964
 
13.3%
l 2014
 
9.0%
n 1913
 
8.6%
o 1803
 
8.1%
i 1715
 
7.7%
$ 1143
 
5.1%
0 995
 
4.5%
U 945
 
4.2%
( 646
 
2.9%
S 646
 
2.9%
Other values (22) 7551
33.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 22335
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2964
 
13.3%
l 2014
 
9.0%
n 1913
 
8.6%
o 1803
 
8.1%
i 1715
 
7.7%
$ 1143
 
5.1%
0 995
 
4.5%
U 945
 
4.2%
( 646
 
2.9%
S 646
 
2.9%
Other values (22) 7551
33.8%
Distinct149
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-07-11T11:56:11.735751image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length92
Median length2
Mean length14.25
Min length2

Characters and Unicode

Total characters13623
Distinct characters65
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)6.9%

Sample

1st row-1
2nd row-1
3rd row-1
4th rowOak Ridge National Laboratory, National Renewable Energy Lab, Los Alamos National Laboratory
5th rowCommerce Signals, Cardlytics, Yodlee
ValueCountFrequency (%)
1 634
29.3%
national 44
 
2.0%
laboratory 28
 
1.3%
novartis 25
 
1.2%
group 21
 
1.0%
pfizer 20
 
0.9%
17
 
0.8%
glaxosmithkline 17
 
0.8%
international 17
 
0.8%
los 17
 
0.8%
Other values (491) 1323
61.2%
2024-07-11T11:56:12.138692image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1207
 
8.9%
e 1065
 
7.8%
a 903
 
6.6%
o 743
 
5.5%
t 741
 
5.4%
i 697
 
5.1%
r 693
 
5.1%
- 643
 
4.7%
n 643
 
4.7%
1 634
 
4.7%
Other values (55) 5654
41.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 13623
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1207
 
8.9%
e 1065
 
7.8%
a 903
 
6.6%
o 743
 
5.5%
t 741
 
5.4%
i 697
 
5.1%
r 693
 
5.1%
- 643
 
4.7%
n 643
 
4.7%
1 634
 
4.7%
Other values (55) 5654
41.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 13623
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1207
 
8.9%
e 1065
 
7.8%
a 903
 
6.6%
o 743
 
5.5%
t 741
 
5.4%
i 697
 
5.1%
r 693
 
5.1%
- 643
 
4.7%
n 643
 
4.7%
1 634
 
4.7%
Other values (55) 5654
41.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 13623
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1207
 
8.9%
e 1065
 
7.8%
a 903
 
6.6%
o 743
 
5.5%
t 741
 
5.4%
i 697
 
5.1%
r 693
 
5.1%
- 643
 
4.7%
n 643
 
4.7%
1 634
 
4.7%
Other values (55) 5654
41.5%

Interactions

2024-07-11T11:56:04.736237image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-07-11T11:56:04.307481image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-07-11T11:56:04.515767image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-07-11T11:56:04.791556image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-07-11T11:56:04.362293image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-07-11T11:56:04.587930image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-07-11T11:56:04.874610image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-07-11T11:56:04.438675image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-07-11T11:56:04.652400image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Missing values

2024-07-11T11:56:04.981546image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
A simple visualization of nullity by column.
2024-07-11T11:56:05.137077image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Unnamed: 0Job TitleSalary EstimateJob DescriptionRatingCompany NameLocationHeadquartersSizeFoundedType of ownershipIndustrySectorRevenueCompetitors
00Data Scientist$53K-$91K (Glassdoor est.)Data Scientist\nLocation: Albuquerque, NM\nEducation Required: Bachelor’s degree required, preferably in math, engineering, business, or the sciences.\nSkills Required:\nBachelor’s Degree in relevant field, e.g., math, data analysis, database, computer science, Artificial Intelligence (AI); three years’ experience credit for Master’s degree; five years’ experience credit for a Ph.D\nApplicant should be proficient in the use of Power BI, Tableau, Python, MATLAB, Microsoft Word, PowerPoint, Excel, and working knowledge of MS Access, LMS, SAS, data visualization tools, and have a strong algorithmic aptitude\nExcellent verbal and written communication skills, and quantitative analytical skills are required\nApplicant must be able to work in a team environment\nU.S. citizenship and ability to obtain a DoD Secret Clearance required\nResponsibilities: The applicant will be responsible for formulating analytical solutions to complex data problems; creating data analytic models to improve data metrics; analyzing customer behavior and trends; delivering insights to stakeholders, as well as designing and crafting reports, dashboards, models, and algorithms to make data insights actionable; selecting features, building and optimizing classifiers using machine learning techniques; data mining using state-of-the-art methods, extending organization’s data with third party sources of information when needed; enhancing data collection procedures to include information that is relevant for building analytic systems; processing, cleansing, and verifying the integrity of data used for analysis; doing ad-hoc analysis and presenting results in a clear manner; and creating automated anomaly detection systems and constant tracking of its performance.\nBenefits:\nWe offer competitive salaries commensurate with education and experience. We have an excellent benefits package that includes:\nComprehensive health, dental, life, long and short term disability insurance\n100% Company funded Retirement Plans\nGenerous vacation, holiday and sick pay plans\nTuition assistance\n\nBenefits are provided to employees regularly working a minimum of 30 hours per week.\n\nTecolote Research is a private, employee-owned corporation where people are our primary resource. Our investments in technology and training give our employees the tools to ensure our clients are provided the solutions they need, and our very high employee retention rate and stable workforce is an added value to our customers. Apply now to connect with a company that invests in you.3.8Tecolote Research\n3.8Albuquerque, NMGoleta, CA501 to 1000 employees1973Company - PrivateAerospace & DefenseAerospace & Defense$50 to $100 million (USD)-1
11Healthcare Data Scientist$63K-$112K (Glassdoor est.)What You Will Do:\n\nI. General Summary\n\nThe Healthcare Data Scientist position will join our Advanced Analytics group at the University of Maryland Medical System (UMMS) in support of its strategic priority to become a data-driven and outcomes-oriented organization. The successful candidate will have 3+ years of experience with Machine Learning, Predictive Modeling, Statistical Analysis, Mathematical Optimization, Algorithm Development and a passion for working with healthcare data. Previous experience with various computational approaches along with an ability to demonstrate a portfolio of relevant prior projects is essential. This position will report to the UMMS Vice President for Enterprise Data and Analytics (ED&A).\n\nII. Principal Responsibilities and Tasks\n\n• Develops predictive and prescriptive analytic models in support of the organization’s clinical, operations and business initiatives and priorities.\n• Deploys solutions so that they provide actionable insights to the organization and are embedded or integrated with application systems\n• Supports and drives analytic efforts designed around organization’s strategic priorities and clinical/business problems\n• Works in a team to drive disruptive innovation, which may translate into improved quality of care, clinical outcomes, reduced costs, temporal efficiencies and process improvements.\n• Builds and extends our analytics portfolio supported by robust documentation\n• Works with autonomy to find solutions to complex problems using open source tools and in-house development\n• Stays abreast of state-of-the-art literature in the fields of operations research, statistical modeling, statistical process control and mathematical optimization\n• Creates, communicates, and manages the project plans and other required project documentation and provides updates to leadership as necessary\n• Develops and maintains relationships with business, IT and clinical leaders and stakeholders across the enterprise to facilitate collaboration and effective communication\n• Works with the analytics team and clinical/business stakeholders to develop pilots so that they may be tested and validated in pilot settings\n• Performs analysis to evaluate primary and secondary objectives from such pilots\n• Assists leadership with strategies for scaling successful projects across the organization and enhances the analytics applications based on feedback from end-users and clinical/business consumers\n• Assists leadership with dissemination of success stories (and failures) in an effort to increase analytics literacy and adoption across the organization.\n\nWhat You Need to Be Successful:\n\nIII. Education and Experience\n\n• Master’s or higher degree (may be substituted by relevant work experience) in applied mathematics, physics, computer science, engineering, statistics or a related field\n• 3+ years of Mathematical Optimization, Machine Learning, Predictive Analytics and Algorithm Development experience (experience with tools such as WEKA, RapidMiner, R. Python or other open source tools strongly desired)\n• Strong development skills in two or more of the following: C/C++, C#, Python, Java\n• Combining analytic methods with advanced data visualizations\n• Expert ability to breakdown and clearly define problems\n• Experience with Natural Language Processing preferred\n\nIV. Knowledge, Skills and Abilities\n\n• Proven communications skills – Effective at working independently and in collaboration with other staff members. Capable of clearly presenting findings orally, in writing, or through graphics.\n• Proven analytical skills – Able to compare, contrast, and validate work with keen attention to detail. Skilled in working with “real world” data including scrubbing, transformation, and imputation.\n• Proven problem solving skills – Able to plan work, set clear direction, and coordinate own tasks in a fast-paced multidisciplinary environment. Expert at triaging issues, identifying data anomalies, and debugging software.\n• Design and prototype new application functionality for our products.\n• Change oriented – actively generates process improvements; supports and drives change, and confronts difficult circumstances in creative ways\n• Effective communicator and change agent\n• Ability to prioritize the tasks of the project timeline to achieve the desired results\n• Strong analytic and problem solving skills\n• Ability to cooperatively and effectively work with people from various organization levels\n\nWe are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.3.4University of Maryland Medical System\n3.4Linthicum, MDBaltimore, MD10000+ employees1984Other OrganizationHealth Care Services & HospitalsHealth Care$2 to $5 billion (USD)-1
22Data Scientist$80K-$90K (Glassdoor est.)KnowBe4, Inc. is a high growth information security company. We are the world's largest provider of new-school security awareness training and simulated phishing. KnowBe4 was created to help organizations manage the ongoing problem of social engineering. Tens of thousands of organizations worldwide use KnowBe4's platform to mobilize their end users as a last line of defense and enable them to make better security decisions, every day.\n\nWe are ranked #1 best place to work in technology nationwide by Fortune Magazine and have placed #1 or #2 in The Tampa Bay Top Workplaces Survey for the last four years. We also just had our 27th record-setting quarter in a row!\n\nThe Data Scientist will work closely with the VP of FP&A and the Quantitative Analytics Manager to implement advanced analytical models and other data-driven solutions.\n\nResponsibilities:\nWork with key stakeholders throughout the organization to identify opportunities using financial data to develop business solutions.\nDevelop new and enhance existing data collection procedures to ensure that all data relevant for analyses is captured.\nCleanse, consolidate, and verify the integrity of data used in analyses.\nBuild and validate predictive models to increase customer retention, revenue generation, and other business outcomes.\nDevelop relevant statistical models to assist with profitability forecasting\nCreate the analytics to leverage known, inferred and appended information about origins and recognizing patterns to assist in outlook forecasting\nAssist in the design and data modeling of data warehouse.\nVisualize data, especially in reports and dashboards, to communicate analysis results to stakeholders.\nExtend data collection to unstructured data within the company and external sources\nMine and collect data (both structured and unstructured) to detect patterns, opportunities and insights that drive our organization\nCreate and execute automation and data mining requests utilizing SQL, Access, Excel, SAS and other statistical programs\nTrouble shoot forecast and optimization anomalies with FP&A team through the use of statistical and mathematical optimization models. Develop testing to explain and or reduce these anomalies.\nOversee and develop key metric forecasts as well as provide budget support based on trends in the business/industry.\nMinimum Qualifications:\nMaster's degree in Statistics, Computer Science, Mathematics or other quantitative discipline required\n2-3 years of experience in similar role (Master's Degree)\n0-2 years of experience in similar role (PhD)\nExperience leveraging predictive modeling, big data analytics, exploratory data analysis and machine learning to drive significant business impact\nExperience with statistical computer languages (Python, R etc.) to manipulate and analyze large datasets preferred.\nExperience with data visualization tools like D3.js, matplotlib, etc., preferred\nExceptional understanding of machine learning algorithms such as Random Forest, SVM, k-NN, Naïve Bayes, Gradient Boosting a plus.\nApplied statistical skills including statistical testing, regression, etc.\nExperience with data bases, query languages, and associated data architecture.\nExperience with distributed computing tools (Hive, Spark, etc.) is a plus.\nStrong analytical skills and ability to meet project deadlines.\nNote: An applicant assessment, background check and drug test may be part of your hiring procedure.\n\nNo recruitment agencies, please.4.8KnowBe4\n4.8Clearwater, FLClearwater, FL501 to 1000 employees2010Company - PrivateSecurity ServicesBusiness Services$100 to $500 million (USD)-1
33Data Scientist$56K-$97K (Glassdoor est.)*Organization and Job ID**\nJob ID: 310709\n\nDirectorate: Earth & Biological Sciences\n\nDivision: Biological Sciences\n\nGroup: Exposure Science Team\n*Job Description**\nThe Biological System Science (BSS) Group in the Biological Sciences Division of the Pacific Northwest National Laboratory (PNNL) is seeking a staff scientist with multidisciplinary experience in computational chemistry, cheminformatics, advanced statistics and/or machine learning/deep learning/AI. Preferred candidates will have a broad understanding of the state of computational metabolomics and experience in designing and implementing novel deep learning networks for chemistry applications. Research experience in drug design, cheminformatics, deep learning, machine learning and/or small molecule identification is also highly valued. Successful candidates will join a large, uniquely collaborative, collegial group of innovators driving the integration of data science, computational science and analytical chemistry to solve the nations most challenging problems in human health, chemical forensics, and national security. The BSS Group is diverse and inclusive, working closely with colleagues across the laboratory with expertise in computational biology, integrative omics, applied mathematics, computer science, and statistics.\n\n+ Apply knowledge of statistics, machine learning, advanced mathematics, simulation, software development, and data modeling to to design, development and implement methods that integrate, clean and analyze data, recognize patterns, address uncertainty, pose questions, and make discoveries from structured and/or unstructured data.\n\n+ Produce solutions driven by exploratory data analysis from complex and high-dimensional datasets.\n\n+ Design, develop, and evaluate predictive models and advanced algorithms that lead to optimal value extraction from data.\n\n+ Develop and maintain existing deep learning networks that generate novel molecules for drug discovery applications\n\n+ Contribue or author proposals, peer-reviewed papers, and other technical products.\n*Minimum Qualifications**\nBS/BA with 0-1 years of experience or MS/MA with 0-1 years of experience\n*Preferred Qualifications**\n+ MS in chemical engineering, computer science, or related field with a GPA of 3.5+ 5+ years of research experience\n\n+ Intermediate level programming experience (preferably Python) and high-performance computing experience\n\n+ At least one first author published, or proof of submitted, paper applying deep learning for use in novel compound generation\n\n+ Understanding of the NMDA receptor and potential drug targets\n\n+ Research experience in drug design, cheminformatics, deep learning, machine learning and/or small molecule identification\n*Equal Employment Opportunity**\nBattelle Memorial Institute (BMI) at Pacific Northwest National Laboratory (PNNL) is an Affirmative Action/Equal Opportunity Employer and supports diversity in the workplace. All employment decisions are made without regard to race, color, religion, sex, national origin, age, disability, veteran status, marital or family status, sexual orientation, gender identity, or genetic information. All BMI staff must be able to demonstrate the legal right to work in the United States. BMI is an E-Verify employer. Learn more at jobs.pnnl.gov.\n*_Please be aware that the Department of Energy (DOE) prohibits DOE employees and contractors from participation in certain foreign government talent recruitment programs. If you are offered a position at PNNL and are currently a participant in a foreign government talent recruitment program you will be required to disclose this information before your first day of employment._**\n_Directorate:_ _Earth & Biological Sciences_\n\n_Job Category:_ _Scientists/Scientific Support_\n\n_Group:_ _Biological Systems Science_\n\n_Opening Date:_ _2020-03-26_\n\n_Closing Date:_ _2020-04-05_3.8PNNL\n3.8Richland, WARichland, WA1001 to 5000 employees1965GovernmentEnergyOil, Gas, Energy & Utilities$500 million to $1 billion (USD)Oak Ridge National Laboratory, National Renewable Energy Lab, Los Alamos National Laboratory
44Data Scientist$86K-$143K (Glassdoor est.)Data Scientist\nAffinity Solutions / Marketing Cloud seeks smart, curious, technically savvy candidates to join our cutting-edge data science team. We hire the best and brightest and give them the opportunity to work on industry-leading technologies.\nThe data sciences team at AFS/Marketing Cloud build models, machine learning algorithms that power all our ad-tech/mar-tech products at scale, develop methodology and tools to precisely and effectively measure market campaign effects, and research in-house and public data sources for consumer spend behavior insights. In this role, you'll have the opportunity to come up with new ideas and solutions that will lead to improvement of our ability to target the right audience, derive insights and provide better measurement methodology for marketing campaigns. You'll access our core data asset and machine learning infrastructure to power your ideas.\nDuties and Responsibilities\n· Support all clients model building needs, including maintaining and improving current modeling/scoring methodology and processes,\n· Provide innovative solutions to customized modeling/scoring/targeting with appropriate ML/statistical tools,\n· Provide analytical/statistical support such as marketing test design, projection, campaign measurement, market insights to clients and stakeholders.\n· Mine large consumer datasets in the cloud environment to support ad hoc business and statistical analysis,\n· Develop and Improve automation capabilities to enable customized delivery of the analytical products to clients,\n· Communicate the methodologies and the results to the management, clients and none technical stakeholders.\nBasic Qualifications\n· Advanced degree in Statistics/Mathematics/Computer Science/Economics or other fields that requires advanced training in data analytics.\n· Being able to apply basic statistical/ML concepts and reasoning to address and solve business problems such as targeting, test design, KPI projection and performance measurement.\n· Entrepreneurial, highly self-motivated, collaborative, keen attention to detail, willingness and capable learn quickly, and ability to effectively prioritize and execute tasks in a high pressure environment.\n· Being flexible to accept different task assignments and able to work on a tight time schedule.\n· Excellent command of one or more programming languages; preferably Python, SAS or R\n· Familiar with one of the database technologies such as PostgreSQL, MySQL, can write basic SQL queries\n· Great communication skills (verbal, written and presentation)\nPreferred Qualifications\n· Experience or exposure to large consumer and/or demographic data sets.\n· Familiarity with data manipulation and cleaning routines and techniques.2.9Affinity Solutions\n2.9New York, NYNew York, NY51 to 200 employees1998Company - PrivateAdvertising & MarketingBusiness ServicesUnknown / Non-ApplicableCommerce Signals, Cardlytics, Yodlee
55Data Scientist$71K-$119K (Glassdoor est.)CyrusOne is seeking a talented Data Scientist who holds a range of data-focused skills both in technical and analytical domains. The ideal candidate is adept at processing, cleansing, and verifying the integrity of data used for visualization and analysis. This role is dynamic, granting the candidate the opportunity to participate in a wide variety of projects and collaborate with many cross-functional teams throughout the business.\n\nDuties and Responsibilities:\nParticipate in an agile scrum cadence\nProcess, cleanse, and verify the integrity of data used for analysis\nPerform functional business requirements analysis and data analysis\nDevelop data models and algorithms to apply to data sets\nAugment data collection procedures to include necessary information for building accurate analytics\nCollaborate with stakeholders throughout the organization to identify opportunities for leveraging data to drive business solutions\nEvaluate the effectiveness and accuracy of data sources and data gathering techniques\nGather critical information from meetings with various stakeholders and produce useful reports\nCoordinate with cross-functional teams to implement models and monitor outcomes\nDevelop automated discrepancy detection systems and distribute reconciliation reports to stakeholders\nRequirements:\nMust be legally authorized to work in the United States for any employer without sponsorship\nProfessional experience using statistical software languages like R, Python, and SQL to query, manipulate, and draw insights from data sets\nStrong problem-solving skills with an emphasis on product development\nExtensive experience with Microsoft SQL, MySQL and MongoDB\nUnderstanding of version control (git) and project management with Azure DevOps\nKnowledge of machine learning techniques (clustering, decision tree learning, artificial neural networks, etc.)\nExperience visualizing data for stakeholders using visualization tools such as Power BI\nExperience working with and creating data architectures\nUnderstanding and adherence to agile principles and practices\nAbility to work on problems of any scope where the analysis of situations or data requires a review of a variety of factors\nSelf-maintainability and reliability with minimal supervision\nExcellent interpersonal communication, decision making, presentation, and organizational skills\nAbility to build productive internal/external working relationships\nHarmonious with CyrusOne culture, core values, and business goals\nMinimum Qualifications:\n2+ years of related experience in a data analyst role\nStrong can-do attitude in a time sensitive environment\nOther important information about this position:\nThis position requires typical weekday (Monday - Friday) attendance in an office setting, at times after hours work may be required to meet business and customer needs\nEvery position requires certain physical capabilities. CyrusOne seeks to make reasonable accommodations that enable individuals with disabilities to perform essential duties when possible\nCyrusOne is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, or other legally protected status.\n\nCyrusOne provides reasonable accommodation for qualified individuals with disabilities in accordance with the Americans with Disabilities Act (ADA) and any other state or local laws. We will respond to requests for reasonable accommodations to assist you in applying for positions at CyrusOne, or to submit a resume. If you need to request an accommodation, please contact our Human Resources at 214.488.1365 (Option 7) or by email at HR@cyrusone.com.3.4CyrusOne\n3.4Dallas, TXDallas, TX201 to 500 employees2000Company - PublicReal EstateReal Estate$1 to $2 billion (USD)Digital Realty, CoreSite, Equinix
66Data Scientist$54K-$93K (Glassdoor est.)Job Description\n\n**Please only local candidates apply - thank you**\n\nClearOne Advantage is a fast-growing company that is aggressively hiring due to increased business. We are always improving our marketing, culture and technology to provide our employees with the best work atmosphere and our customers with excellent customer service. COA’s proprietary software is tailored to our industry and allows the client to receive the best service possible.\n\nWe are looking for a Data Scientist to analyze large amounts of raw information to find patterns that will help improve our company. We will rely on you to build data products to extract valuable business insights. In this role, you should be highly analytical with a knack for analysis, math and statistics. Critical thinking and problem-solving skills are essential for interpreting data. We want to see a passion for machine-learning and research. Your goal will be to help our company analyze trends to make better decisions.\n\nIf you are looking to work in a team environment, a place where you are more a name than a number, where you interact with leadership daily, then please send your resume for review!\n\nPerks:\nGreat location, right on the water in the Canton Crossing Tower\nCasual work environment and WFH flexibility\nRoom for advancement\nWhat you'll be doing:\nIdentify valuable data sources and automate collection processes\nUndertake preprocessing of structured and unstructured data\nAnalyze large amounts of information to discover trends and patterns\nBuild predictive models and machine-learning algorithms\nCombine models through ensemble modeling\nPresent information using data visualization techniques\nPropose solutions and strategies to business challenges\nCollaborate with engineering and product development teams4.1ClearOne Advantage\n4.1Baltimore, MDBaltimore, MD501 to 1000 employees2008Company - PrivateBanks & Credit UnionsFinanceUnknown / Non-Applicable-1
77Data Scientist$86K-$142K (Glassdoor est.)Advanced Analytics – Lead Data Scientist\nOverview\n\n\nWe are looking for a Data Scientist to join our Data Science team to work on interesting projects to help our clients make data driven solutions. As a Data Scientist, you’ll work closely with the clients to understand their business needs, frame them as statistical problems, and solve them with cutting edge techniques. Collaborate with your team, including machine learning engineers, data engineers, analysts, and TPMs to define tasks, provide estimates, and work together to deliver a world class solution. The ideal candidate will have the balance of technical skills and business acumen to help the client better understand their core needs while understanding technical limitations.\n\nAbout you…\nExperience partnering & communicating with executive management team to understand business needs and pain points\nAbility to communicate data science concepts to business stakeholders\nPassion for the application of machine learning to real world problems\nAdept at developing and iterating solutions rapidly\nAbility to lead development of data science solutions\nWhat we offer our consultants:\n\nExperience working with both large enterprise clients and mid-sized clients\nProgressive responsibilities that encourage ownership and design\nOpportunity to learn and gain experience in complimentary skills such as meeting facilitation, data management, project management, data modeling, and data management\nCompany Culture that celebrates “Foster the culture of we”, “Act with integrity” and “Drive towards excellence” while having fun at work\nTraining and certification opportunities to support your career now and after Logic20/20\nVarious opportunities to give back to the community through company sponsored events\nRequired Qualifications\nExperience building machine learning models using Python\nExperience deploying machine learning models in a production environment\nStrong knowledge of probability statistics\nExperience with Tensorflow or PyTorch\nExperience writing SQL to query databases, structure and modify data\nDemonstrated ability to frame business problems as statistical problems and solve them\nAbility to work both independently and as part of a team\nExperience working in ambiguous and dynamic environments that move quickly\nAn undergraduate degree in mathematics, computer science, or engineering is preferred\nPreferred Qualifications\nPassion and experience driving adoption of machine learning in industry\nExperience deploying machine learning on large scales through Spark or other big data technology\nExperience building systems in AWS\nExperience in computer vision with deep neural networks\nExperience with leading workshops with executives to drive requirements gathering\nMasters or PhD in data science or related field\n\nAbout Logic20/20. . .\n\n\nLogic20/20 is one of Seattle’s fastest growing full-service consulting firms. Our core competency is creating simplicity and efficiency in complex solutions. Although we make it look like magic, we succeed by combining methodical and structured approaches with our substantial experience to design elegant solutions for even the most intricate challenges. Our rapid growth is in response to our ability to deliver consistently for our clients, which is directly related to the quality of the people we hire.\n\nThe past four years, we’ve been in the top 10 “Best Companies to Work For” ….. why? Our team members are highly self-motivated, comfortable conceiving strategies on the fly, and enjoy working both individually and as part of a team. Our environment is very high-energy and demanding, and individuals with remarkable enthusiasm and a can-do attitude are joining our team. We have lots of fun, focus on our employees and our clients, and work to bring our best to every opportunity.3.8Logic20/20\n3.8San Jose, CASeattle, WA201 to 500 employees2005Company - PrivateConsultingBusiness Services$25 to $50 million (USD)-1
88Research Scientist$38K-$84K (Glassdoor est.)SUMMARY\n\nThe Research Scientist I will be tasked with oversight of research in the Division of Cancer Biology Research at the Rochester General Hospital Research Institute.\n\nA strong background in Molecular Biology or Cancer Biology Research is preferred. Mouse models will be used in the research.\n\nSTATUS: Full Time\n\nLOCATION: RGH Research Institute\n\nDEPARTMENT: Cancer Biology\n\nSCHEDULE: Monday-Friday; Days\n\nATTRIBUTES\nMD or PhD who is not self supporting of their own salary nor has their own research program\nFunctions with minimal direction from Research Scientist II, Senior Research Scientist or Laboratory Director.\nStrong analytical, computer, leadership and problem-solving skills\nRESPONSIBILITIES\nConducts research projects including complex experiments, some in parallel, utilizing current concepts and recognized standard techniques, developing new protocols as necessary\nDemonstrates a high level of initiative in performing experiments, analyzing data and drawing conclusions regarding progress and results of work.\nMaintains a familiarity with current and emerging technologies through reading and understanding scientific and technical literature resulting in a broadening understanding of disciplines outside area of training and enabling the use of new and improved procedures in the laboratory.\nDuties are performed with an understanding of drug discovery in area of specialization.\nEDUCATION PhD; MD Rochester Regional Health is an Equal Opportunity / Affirmative Action Employer. Minority/Female/Disability/Veteran3.3Rochester Regional Health\n3.3Rochester, NYRochester, NY10000+ employees2014HospitalHealth Care Services & HospitalsHealth Care$500 million to $1 billion (USD)-1
99Data Scientist$120K-$160K (Glassdoor est.)isn’t your usual company. Our work is powered by the premise that every person at is unique, possessing a distinct set of skills, personality, and passions. We embrace our collective talents to tackle technical challenges, refine our successfully disruptive business ideas, and co-create one of the most human and inspiring work cultures out there. We are a team of collaborators, valuing and rewarding shared success over individual heroics.\nAs a member of our Data Science team, you will use your quantitative expertise to identify new areas of research and optimization, and then see those ideas through to production. Data Science is a fundamental contributor to Intent’s success - your work will have a direct and tangible impact on the business.\nThere are no typical projects, but a workflow might involve performing research and analysis against petabytes of historical data using our collection of large-scale analytics tools like Spark, Snowflake, and RedShift, building prototypes using mostly Scala or another functional language, pairing with engineers on the Modeling and Prediction team to harden and deploy the functionality, and running live tests to monitor the results.\nAll of these steps take place in an environment of respect and collaboration, and the Data Science team is empowered to own its agile processes. Every member of the team is expected to be both a student and teacher, and we believe that the most effective Data Science team is one that is collectively learning and growing. Experience in coaching and mentoring colleagues at all levels is strongly desired. As part of the Data Science team, you’d help build out a real-time predictive analytics platform that makes decisions for some of the largest sites on the web.\nAbout You:\nSignificant industry experience in several of the following areas: personalized experiences, big data analytics, implementing machine learning & statistical methods, designing and running A/B tests, product design and life cycle, writing production code, designing online auctions.\nExperience in user experience customization a plus\nExperience coaching and mentoring team members\nExperience writing production software in languages like Scala, Clojure, Java, Python, or C++ in an agile, collaborative environment\nExperience with handling large amounts of data (TB+) in a production setting\nExperience with Spark is a significant plus\nExperience in ad-tech a plus\nAbout Us:\nis the data science company for the world’s leading online commerce and travel brands. Our Predictive Intelligence Platform uses patented technology to predict user behavior in real-time and identify the future value of every user. Over 450 innovative brands from more than 40 countries trust Intent’s real-time predictions to deliver personalized user experiences that maximize utility and ROI.\nOur team is over 100 people and our offices span globally. We’re headquartered in NYC with locations in London, Kuala Lumpur, and Sao Paulo.\nEvery day, we’re inspired by two pursuits. First, we’re building novel products that are upending e-commerce. Second, we’re building the company we’ve always wanted to work for — one that’s open, human and collaborative, where very smart people come together to share ideas and get things done. We’re included on Built in NYC's Best Places to Work list and have been on Crain’s 100 Best Places to Work in NYC list for seven years running.\nLove Your Job!\nOur employees enjoy coming to work, and we let them know they're valued.\nOur vibrant team accomplishes a lot every day, but we insist upon work/life balance so things never become stale. We don’t take ourselves too seriously, but we take our work very seriously.\nWe believe that in order for our employees to perform their best, they need access to strategic decisions, and so our flat structure and open communication invite innovation from all levels — ideas flow freely.\nWe offer competitive compensation, stock options, and great perks & benefits, including:\nUnlimited vacation\nA generous parental leave policy\nA beautiful, dog-friendly office in SoHo with drinks and snacks\nAn open environment with lots of natural light and roof deck access\nAnnual $2,000 learning budget and Citi Bike membership\nAccess to Fond, our employee perks program featuring deals and discounts on hundreds of products and services\nAccess to Sherpaa, a telehealth service with 24/7\nIn-office yoga classes\nCompany-wide social events, and more!\nSo what are you waiting for? Apply with your resume in just a few clicks!\nAbout Us\nOur Products\nOur Dogs\nTwitter\nInstagram4.6<intent>\n4.6New York, NYNew York, NY51 to 200 employees2009Company - PrivateInternetInformation Technology$100 to $500 million (USD)Clicktripz, SmarterTravel
Unnamed: 0Job TitleSalary EstimateJob DescriptionRatingCompany NameLocationHeadquartersSizeFoundedType of ownershipIndustrySectorRevenueCompetitors
946946Senior Data Analyst$99K-$178K (Glassdoor est.)Senior Data Analyst\n\nAbout us\n\n\nLife360 brings families closer with smart tools designed to protect and connect the people who matter most.\n\nKnown for first-to-market solutions for modern family challenges, Life360 recently reached #1 in Apple's US App Store's list of free social networking apps. Nearly 1 in 10 US families with kids use Life360 an average of 12 times a day, and global membership is growing exponentially, with over 25 million monthly active users in over 140 countries making Life360 the largest mobile service for families in the world.\n\nThis reach gives us the opportunity to do unprecedented good for families through our valued core offerings: advanced location sharing, private messaging, driver monitoring, help alerts, 24/7 roadside assistance, and Crash Detection with emergency response. On average we respond to 1,000 roadside assists and dispatch 200+ ambulances each month to those in need.\n\nOffering both free and paid memberships. In addition, the company has raised over $200 million in equity financing, and recently completed an IPO on the ASX exchange giving our employees the liquidity of a public company with the upside of a private growth stage business.\n\nLife360's rapidly growing team of 150+ employees is headquartered in San Francisco, with offices in San Diego, and Las Vegas.\n\nAbout the Job\n\n\nData plays a crucial role in Life360's growth by driving smarter decisions, improving operations, and creating higher value user experiences. As an analytics team, we partner with a wide variety of cross-functional partners to apply data insights against strategic initiatives. "Know Our Users" is a Life360 core value and we're looking for analytics professionals who are passionate about leveraging user data to create value for Life360 families.\n\nYou'll be working in a dynamic growth environment, leading efforts to better understand the business, the product, and the customer. Life360 has one of the most interesting datasets in the world: location, driving, product usage, and purchasing behavioral data - all centered around who matters most, the family. If you have a passion for making an impact and working on products that help millions of families around the world, then this is the right place for you.\n\nResponsibilities\n\n\nAnalytics team members work closely with specific strategic teams but also have opportunities to work on company-wide initiatives. This person is expected to focus on that particular area but also generalize their skills towards other parts of the business with a variety of projects.\n\nIn this role, we are looking for someone to partner with the Revenue team in developing actionable insights from both product and financial perspectives. Common projects range from financial disclosure reports that tell Life360's growth story to conducting deep-dive analyses into identifying opportunities for subscription growth. Ultimately, you will be tasked with finding data insights that deliver business value.\n\nThese are some typical responsibilities:\nLeverage data to understand the Life360 family and their product usage, developing insights that apply to product, marketing, and business strategy.\nPartner with executives, product managers, engineers, marketers, designers to translate data insights into smarter decisions and applications.\nEstablish and manage KPIs that measure the health of the business, product performance, and customer experience quality.\nBuild dashboards and reporting processes to monitor business and product trends.\nDevelop frameworks, tools, and best practices to apply data insights towards business questions.\nConduct analyses and build models that identify opportunities and drive growth.\nDesign and analyze experiments, communicate results, and drive decisions.\nPotential projects may include forecasting business performance, developing family driving profiles, and predicting customer lifetime value.\nQualifications\n\n\nWe are looking for candidates with a diverse background that will compliment the skills and backgrounds of the current team. If you don't fit all the criteria below please apply anyway as this list is more of a preference rather than a rule. Our priority is for a well rounded team that delivers results.\nWe are looking for candidates who have had previous experience on analytics teams and are willing to help coach and mentor colleagues on best data practices. 5+ years is preferred.\nDegree in a quantitative field like statistics, economics, applied math, operations research, or engineering, finance, business intelligence. Advanced degrees are preferred.\nSQL expertise - able to write structured and efficient queries on large datasets.\nExperience in scripting languages, like analysis and visualization libraries in Python or R.\nStrong verbal/written communication skills and the ability to collaborate with cross-functional partners to build the business.\nProficiency in building data visualizations and interactive dashboards with tools like Tableau.\nExperience designing and evaluating experiments to draw inferential recommendations.\nCuriosity to learn about new topics and uncover hidden insights.\nPerks\nFridays are Work From Home days at Life360\nCompetitive pay and benefits\nFree snacks, drinks (three ways to brew your favorite cup of coffee), and food in the office\nCatered lunches throughout the week\nHealth, dental and vision insurance plans\n401k plan\n$200/month Quality of Life perk\nA great office with plenty of light in the heart of the SOMA district in beautiful San Francisco\nWhatever makes you stronger makes us stronger. We buy you the things you need to improve yourself and get your job done.\nThis position is located in San Francisco, CA. It is not a remote role.3.9Life360\n3.9San Francisco, CASan Francisco, CA51 to 200 employees2008Company - PublicComputer Hardware & SoftwareInformation TechnologyUnknown / Non-Applicable-1
947947Data Science Project Manager$37K-$100K (Glassdoor est.)At MassMutual, we are passionate about helping millions of people find financial freedom and this passion has driven our approach to developing meaningful experiences for our customers. The Data Science team, part of the Enterprise Technology and Experience organization, is comprised of highly skilled and collaborative problem solvers who are motivated to create innovative solutions that exceed the changing needs of our customers and move MassMutual and the industry forward.\n\nTo continue our cutting-edge work, we are hiring a Data Science Project Manager to join our team.\n\nWhat great looks like for this role\n\nA seasoned Project Manager will have the opportunity to apply advanced project and program management knowledge, skills, tools and techniques to project deliverables, processes, communications and presentations in order to meet or exceed stakeholder needs and expectations. The Project Manager will have the ability to think strategically to understand, apply, promote and contribute to MassMutual's delivery methodologies, standards and tools. This individual will work with a team that embraces diversity in all of its forms, respects others and looks to have fun.\n\nObjectives of this role\nTo scale our data science impact.\nTo impact complex business goals through the delivery of quality work timely.\nTo ensure documentation is in place and process is followed meeting standards\nDaily and Monthly Responsibilities - What You Will Do:\nLead broad scope projects that have medium to long-term focus\nEngage with all levels across the enterprise\nServe as a conduit of knowledge between functional and technical teams\nCommunicate regularly with individuals both within and outside of our team, managing relationships and expectations\nNavigate ambiguity to deliver results\nDevelop plans for continuous service to support implementation of products\nAct as a champion for data science capabilities by communicating their benefits and how they can be implemented\nProvide consultation, business analysis, project management, and leadership on multiple projects of varying duration, size, and complexity\nMotivate teams to work together, communicate, and deliver\nElicit, translate and simplify requirements\nDocument and organize acceptance criteria for user stories\nManage budget, timeline, and scope throughout the course of all assigned projects\nLead project teams during all phases of the development life cycle including requirements gathering and analysis, design, build, pilot, implementation and continuous service\nFacilitate client and project team interactions including: scrums, sprint planning, sprint retrospectives, sprint reviews, incident management and release management\nWork with product managers to define improvements to business processes, assist decision-makers in gathering information to make decisions, and help quality assurance test solutions\nWork with technical leads, product managers to plan, develop technical scopes of work and manage the execution of projects/product changes in response to requirements from our stakeholders\nBe self-supportive in collaborating with peers to effectively deliver a robust solution for the business\nDrive process within a matrix management setting\nWhat You Will Not Do:\nDesign strategic roadmaps\nLarge amounts of computer programming\nManipulation of large data sets\nSit in solitude at your desk\nBasic Qualifications\nBachelors Degree preferably in Business/Finance or an analytical field such as Economics, Mathematics, Engineering, Computer Science\n4+ years managing and driving the execution of complex projects\nExperience in/working in partnership with a technical role, such as an engineer, developer, data scientist, etc. a plus\nProficient with project management tools and techniques, such as JIRA, Confluence, Scrum and Kanban\nExcellent interpersonal communication, conflict management, coordination, and planning skills with cross-functional teams\nSkilled in applying judgment to balance process compliance with achievement of business objectives\nProject leadership experience focused on engaging others in the delivery and execution of technical solutions and service deliverables\nAbility to assess a project's scope and the team's ability to execute\nOutcome oriented with the ability to drill down from the big picture to process details\nAbility to communicate objectives, plans, status and results clearly\nStrong leadership skills and influencer\nAbility to collaborate across diverse teams and organizations\nStrong organizational skills and detail oriented\nAuthorized to work in the United States without requiring visa sponsorship now or in the future\nPreferred Qualifications\nMasters Degree, preferably in Business/Finance or an analytical field such as Economics, Mathematics, Engineering, Computer Science\nAgile certification or experience\nSolid grasp of software technologies and stacks.\nFormer technical experience is preferred, such as working with data science teams or experience developing and/or deploying predictive models3.6MassMutual\n3.6Boston, MASpringfield, MA5001 to 10000 employees1851Company - PrivateInsurance CarriersInsurance$10+ billion (USD)-1
948948Data Engineer$62K-$113K (Glassdoor est.)Do you find data architecture exciting? Does building a new data pipeline or optimizing a data warehouse make you happy? Can you migrate a data store to the cloud, run a few NLP algorithms to clean things up, and build a set of processes to keep the data current? Are you comfortable with Terabyte-scale data, optimizing cloud stores, building workflow management systems, AWS, and Python scripting? Can you work closely with business stakeholders to understand their needs and sate those through data solutions? If so, we want you!\n\nFivestars is seeking a Senior Data Engineer. Reporting to the Director of Analytics and Data Science, you will work with the Product, Marketing, and Engineering teams at Fivestars to build and maintain world-class data infrastructure.\n\nAt Fivestars, our mission is to help businesses and communities thrive by turning every transaction into a relationship. Over 50 million people use Fivestars to get rewarded at more than 14,000 local businesses with one rewards program. Local businesses use Fivestars to bring more customers into their stores with an all-in-one marketing and payments program. Fivestars drives over $3 billion in local commerce across its network per year.\n\nFivestars was launched out of Y-Combinator in 2011 (most recently on Y-Combinator's Top 75 Companies List for 2019) and has raised over $105 million from notable investors including Lightspeed, DCM, HarbourVest, Menlo Ventures, Y-Combinator, and others. Together, let's love local!\n\nResponsibilities\nBuild and maintain data infrastructure (Redshift/Presto/Kinesis/Glue/EC2/S3/etc.)\nCreate data pipelines to/from external partners using Python and other tools\nUse NLP to clean and consolidate data\nEstablish and use workflow-management tools to orchestrate solutions\nMonitor and improve pipeline and data-warehouse performance\nSkills\nSQL – write sophisticated and optimized queries against large databases\nPython – create efficient and scalable pipelines and solutions\nBusiness Acumen – understand the questions we are trying to answer through data\nProblem Solving – apply structured methods to analyze problems and develop solutions\nCommunication – explain technical concepts clearly and concisely\nRelationships – influence adoption of infrastructure through partnership\nQualifications/Experience\nUndergraduate degree in a highly technical field (e.g. Computer Science, Electrical Engineering, etc.) from a top-tier university\nGraduate degree (MS, PhD, etc.) in a similar field will be highly valued but is not required\n1+ years of experience in a data-engineering function using cloud-based infrastructure\nAbility to solve technical problems and create efficient, robust, and scalable solutions\nDemonstrated intellectual curiosity\nPerks\nPre-IPO stock options\nExcellent medical, dental, and vision coverage\nGreat downtown-SF office location\n4 weeks PTO + 11 paid-holidays per year\nThree in-office lunches per week and a fully-stocked kitchen with fruit, (healthy) snacks, coffee, and drinks\nTeam happy hours and company-sponsored events\nWellness Benefit - $500 per year to spend on eligible physical or mental well being\nFSA; short-/long-term disability coverage; life Insurance; 401K; EAP; and commuter benefits\nFivestars provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics. In addition to federal law requirements, Fivestars complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.3.9Fivestars\n3.9San Francisco, CASan Francisco, CA201 to 500 employees2011Company - PrivateInternetInformation Technology$100 to $500 million (USD)Belly, SpotOn
949949Principal, Data Science - Advanced Analytics$86K-$137K (Glassdoor est.)IQVIA is the leading human data science company focused on helping healthcare clients find unparalleled insights and better solutions for patients. Formed through the merger of IMS Health and Quintiles, IQVIA offers a broad range of solutions that harness the power of healthcare data, domain expertise, transformative technology, and advanced analytics to drive healthcare forward.\n\nJob Description\n\nThe IQVIA Advanced Analytics team is one of the leading healthcare analytical teams in the world. Joining the AA team provides the opportunity to work with extremely complex data and methodologies in a fast-paced, ever-changing environment. We seek highly motivated people who truly want to make a difference in the life sciences industry. At IQVIA, we look for the very best people, and then give them meaningful work to do. we dont simply think about careers, we think about contributions.\n\nAdvanced Analytics - with departments in Philadelphia, Frankfurt, Paris, and Warsaw as well as a network of over 150 team members worldwide - is the global competence center for data science at IQVIA. Complex advanced analysis at the highest level are conceptualized and implemented to support international customers in the pharmaceutical industry - often within multinational projects. As a member of our team you can expect exciting international projects with interesting development perspectives.\n\nThe position will use large data sets to find opportunities for product and process optimization and models to test the effectiveness of different courses of action. Our data scientists have strong experience using a variety of data mining/data analysis methods, building and implementing models, using/creating algorithms and simulations. For this position, we are seeking several years of direct experience with developing algorithms and models to solve prediction problems. Awareness of various techniques available to use in predictive analytics. Using their proven ability to drive business results with their data-based insights, they will comfortably interact and work with a wide range of stakeholders and functional teams. They have a passion for discovering solutions hidden in large data sets and working with stakeholders to improve business outcomes.\n\nWhat were looking for:\nQuantitative background with advanced degrees (Master, PhD preferred) in Statistics, computer science, engineering, informatics, data science, or related field.\nIn-depth understanding of machine learning algorithms and statistical models\nAbility to manage, lead and communicate\nExperience in pharmaceutical or hospital/healthcare industry\nWhat youll be doing:\nBuild machine learning/statistical models and pipelines for solving predictive analytic tasks with electronic healthcare claims and medical records\nApply machine learning, data mining technologies in developing innovative solutions in pharmaceutical industry.\nParticipate at client meetings for complex proposals to present IQVIA advanced analytic methodologies to clients and to bring credibility for IQVIA team\nEnsure data quality throughout all stages of acquisition and processing, including such areas as data collection, normalization, transformation, embedding, visualization, etc.\nPresent study findings to clients and translate analytic outputs to business impact and recommend actions to clients to improve their business performance\nEnsure data quality throughout all stages of acquisition and processing, including such areas as data collection, normalization, transformation, embedding, visualization, etc.\nWork with IQVIA technology team to support machine-learning algorithms in big data platform to solve a variety of business problems.\nIQVIA is an EEO Employer - Minorities/Females/Protected Veterans/Disabled\n\nWe know that meaningful results require not only the right approach but also the right people. Regardless of your role, we invite you to reimagine healthcare with us. You will have the opportunity to play an important part in helping our clients drive healthcare forward and ultimately improve human health outcomes.\n\nWhatever your career goals, we are here to ensure you get there!\n\nWe invite you to join IQVIA.\n\nJoin Us\n\nMaking a positive impact on human health takes insight, curiosity, and intellectual courage. It takes brave minds, pushing the boundaries to transform healthcare. Regardless of your role, you will have the opportunity to play an important part in helping our clients drive healthcare forward and ultimately improve outcomes for patients.\n\nForge a career with greater purpose, make an impact, and never stop learning.\n\nIQVIA is an EEO Employer - Minorities/Females/Protected Veterans/Disabled\n\nIQVIA, Inc. provides reasonable accommodations for applicants with disabilities. Applicants who require reasonable accommodation to submit an application for employment or otherwise participate in the application process should contact IQVIAs Talent Acquisition team at workday_recruiting@iqvia.com to arrange for such an accommodation.3.6IQVIA\n3.6Plymouth Meeting, PADurham, NC10000+ employees2017Company - PublicBiotech & PharmaceuticalsBiotech & Pharmaceuticals$2 to $5 billion (USD)PPD, INC Research, PRA Health Sciences
950950Sr Scientist, Immuno-Oncology - Oncology$58K-$111K (Glassdoor est.)Site Name: USA - Massachusetts - Cambridge\nPosted Date: Mar 24 2020\n\nAre you energized by a challenging role in immuno-oncology, where scientific demand is driving team growth? If so, this Senior Scientist would be a great opportunity to consider.\n\nThe Immune Biology Group within GSKs Immuno-Oncology & Combinations Research Unit (IOC RU) is seeking a Sr. Scientist with experience in immuno-oncology or immunology to join our team.\n\nIn this role, you will be responsible for conducting research designed to identify and validate immune-based therapies for cancer.\n\nThis Sr. Scientist role will provide you the opportunity to lead key activities to progress your career. Responsibilities include:\nDeliver critical path biology results to support GSKs pipeline of cancer immunotherapies from early discovery to first-time-in-human commitment.\nEstablish and expand internal wet lab capabilities at a growing GSK site.\nActively participate in building and maintaining drug discovery relationships with both internal stakeholders and external partners.\nWork within a dynamic and collaborative environment to deliver high-quality scientific data packages to meet experimental and organizational goals.\nWhy you?\nBasic Qualifications:\n\n\nWe are looking for professionals with these required skills to achieve our goals:\nBachelors or Masters degree in immunology, immuno-oncology or related field with 5+/3+ years of experience, respectively.\nStrong scientific background in immunology or immuno-oncology research, with a focus on bioassay development to functionally characterize biologics and/or small molecules.\nResearch expertise in the field of adaptive immunity with a focus on T cell biology with demonstrated ability to independently establish robust in vitro and ex vivo functional assay protocols to investigate mechanisms of action for multiple drug candidates and their combinations.\nExpertise in high-dimensional flow cytometry to phenotypically characterize immune cells from human and murine tissue samples, including both surface and intracellular staining.\nDemonstrated hands-on ability to independently design, conduct, and analyze pharmacology studies.\nStrong communication skills and ability to conduct research in a cross-functional team environment.\nAbility to interpret data clearly and concisely both verbally and in documents and present results in an organized manner.\nAbility to prioritize, manage time efficiently, and implement creative solutions to meet program needs.\nCommitment to continual improvement by reading and applying the latest scientific literature, methodologies and technology where appropriate.\nA high level of integrity and desire to develop transformational medicines that bring benefit to patients\nPreferred Qualifications:\n\n\nIf you have the following characteristics, it would be a plus:\n2+ years pharmaceutical or biotechnology industry research experience working in matrixed drug discovery project teams.\nResearch expertise with functional characterization of myeloid cells\nExperience liaising with Laboratory Operations personnel.\nWhy GSK?\n\nOur values and expectations are at the heart of everything we do and form an important part of our culture. These include Patient focus, Transparency, Respect, Integrity along with Courage, Accountability, Development, and Teamwork. As GSK focuses on our values and expectations and a culture of innovation, performance, and trust, the successful candidate will demonstrate the following capabilities:\nOperating at pace and agile decision-making using evidence and applying judgement to balance pace, rigour and risk.\nCommitted to delivering high quality results, overcoming challenges, focusing on what matters, execution.\nContinuously looking for opportunities to learn, build skills and share learning.\nSustaining energy and well-being\nBuilding strong relationships and collaboration, honest and open conversations.\nBudgeting and cost-consciousness\n*LI-GSK\n\n*This is a job description to aide in the job posting, but does not include all job evaluation\n\nIf you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-877-694-7547 (US Toll Free) or +1 801 567 5155 (outside US).\n\nGSK is an Equal Opportunity Employer and, in the US, we adhere to Affirmative Action principles. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class.\n\nImportant notice to Employment businesses/ Agencies\n\nGSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.\n\nPlease note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSKs compliance to all federal and state US Transparency requirements. For more information, please visit GSKs Transparency Reporting For the Record site.3.9GSK\n3.9Cambridge, MABrentford, United Kingdom10000+ employees1830Company - PublicBiotech & PharmaceuticalsBiotech & Pharmaceuticals$10+ billion (USD)Pfizer, AstraZeneca, Merck
951951Senior Data Engineer$72K-$133K (Glassdoor est.)THE CHALLENGE\nEventbrite has a world-class data repository of live events, powering millions of events and hundreds of millions of ticket transactions each year in 170+ countries. Our platform allows event creators and event goers to have the most meaningful live experiences. As a Senior Data Engineer, you will be part of a team that is building our next-gen big data infrastructure to support both internal and customer-facing applications.\nTHE TEAM\nWe're a people-focused Engineering organization: our people value working together in small teams to solve significant problems, supporting an active culture of mentorship and inclusion, and pushing themselves to learn new things daily. Pair programming, weekly demos, tech talks, and quarterly hackathons are at the core of how we’ve built our team and product. We believe in engaging with the community, regularly hosting free events with some of the top technical speakers, and actively contributing to open source software (check out Britecharts as an example!). Our technology spans the web, mobile, API, Big Data, machine learning, search, physical point of sale, scanning systems, and the data infrastructure required to support those systems.\nTHE ROLE\nWe are hiring a Senior Data Engineer to help us build a scalable, reliable, secure, and highly performant data platform. You'll help reinforce and extend the infrastructure that powers the use of data at Eventbrite. From infrastructure development to data analysis to ETL jobs, you will need a broad range of big data engineering skills. The team has strong and versatile engineers. You will grow. We hope to grow with you.\nTHE SKILL SET\n8-10 years of experience building high quality software in Python, Java, or Scala\n5+ years of experience designing batch, streaming, and event-driven Data Warehouse and ETL architectures with Hadoop ecosystem, such as Spark, Hive, Storm, Presto, Kafka, Hbase, MySQL databases, and HDFS\nUnderstanding of Data Engineering, Data Science, Machine Learning, Data Analytics, and the relevant technologies that support them\nDeep expertise in cloud computing, preferably AWS, security, cluster sizing, and performance tuning. Ability to setup process and systems to monitor and reduce cloud computing costs for a large organization\nExperience building systems to instrument, collect and process billions of events, such as clickstream data. Deep understanding of measuring and ensuring data quality at scale\nOutstanding verbal, written, presentation, and facilitation skills. In particular, a demonstrated ability to effectively communicate technical and business issues and solutions to multiple organizational levels\nAbility to teach and mentor engineers with a variety of skill levels and backgrounds\nVision to define the future of how Big Data and Analytics intersect at Eventbrite. The Analytics community at Eventbrite will rely on you to build and maintain a data environment built for speed, accuracy, consistency and uptime\nSkills to support analytics by building a world-class data warehousing environment that empowers analysts to deliver insights to their stakeholders. Evaluate competing data technologies and tool­sets from various vendors and open-source products; drive platform selection; lead technical architecture, application design and implementation\nSkills to support analytics by building a world class data warehousing environment that empowers analysts to deliver insights to their stakeholders\nEvaluate competing data technologies and toolsets from various vendors and open-source products; drive platform selection; lead technical architecture, application design and implementation\nCombine strong analytical skills with the ability to collect, organize and analyze large amounts of information with attention to detail and accuracy\nPassionate about live entertainment, and eager to help build Eventbrite into the world's leading event technology platform\nStrong analytical and problem-solving skills and attention to detail\n\nBONUS POINTS\nFamiliarity with a server-side frameworks, such as Django, Express, Rails, or .Net\nSkilled in various forms of data modeling including ER, XML Schemas, SQL, logical and physical database design, dimensional modeling, and/or OLAP cubes\nKnowledge of database schemas and models, including 3NF, star schemas, cubes, etc. and in developing physical database schemas from logical models\nStrong knowledge of database optimization and scaling approaches including indexing, partitioning, sharding, clustering, in ­memory tables, horizontal and vertical scaling\nFamiliarity with managing large datasets and understanding the complexities of merging large databases, meeting security audit requirements, and implementing a data retention policies\n\nABOUT EVENTBRITE\nEventbrite is a global ticketing and event technology platform, powering millions of live experiences each year. We empower creators of events of all shapes and sizes – from music festivals, experiential yoga, political rallies to gaming competitions –– by providing them the tools and resources they need to seamlessly plan, promote, and produce live experiences around the world. Last year, the team served 795,000 creators hosting nearly 4 million experiences across 170 countries. Meet some of the Britelings that make it happen.\n\nIS THIS ROLE NOT AN EXACT FIT?\nSign up to keep in touch and we’ll let you know when we have new positions on our team.\n\n\nEventbrite is a proud equal opportunity/affirmative action employer supporting workforce diversity. We do not discriminate based upon race, ethnicity, ancestry, citizenship status, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), marital status, registered domestic partner status, caregiver status, sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, genetic information, military or veteran status, mental or physical disability, political affiliation, status as a victim of domestic violence, assault or stalking, or other applicable legally protected characteristics.\nApplicant Privacy Notice4.4Eventbrite\n4.4Nashville, TNSan Francisco, CA1001 to 5000 employees2006Company - PublicInternetInformation Technology$100 to $500 million (USD)See Tickets, TicketWeb, Vendini
952952Project Scientist - Auton Lab, Robotics Institute$56K-$91K (Glassdoor est.)The Auton Lab at Carnegie Mellon University is a large academic group driven by a desire to make a real-world difference in a broad range of research interests. The areas of our current focus include, but are not limited to, modeling complex temporal and sequential data, structural learning, incorporating diverse feedback, interactive network science and human-machine interaction. We are always interested in finding ways to make Artificial Intelligence more accessible, beneficial and affordable to everyone. The areas of our current application interests include healthcare in clinical, managerial, and new sensing modalities contexts, radiation safety, countering human trafficking, agriculture, predictive maintenance of equipment, multi-modal data analytics, etc.\n\nWe are seeking a Project Scientist to join us in the Auton Lab. In this role, you will act as a team leader for specific areas of research projects in applied data science. Working with principal investigator(s), you will prioritize project goals based on overall organizational goals. You will contribute significantly in the development and documentation of research finding and as a major collaborator of scientific papers. There will be frequent opportunities to present research finding to current or potential sponsors and at major national and international conferences.\n\nCore responsibilities will include:\nPreparing data, developing models, and producing research findings\nContributing to project management and maintenance of customer relationships\nDocumenting research findings, producing reports and synthetic summaries\nContributing to scientific publications\nWorking with principal investigator(s) to formulate research goals and plans\nPreparing and delivering presentation of research findings\nQualifications:\nPhD in machine learning, applied mathematics, statistics, computer science, or other relevant field or equivalent combination of training and experience preferred\n10-15 years of Research Experience required\nProven technical background\nExperience in analyzing of data at scale, proven hands-on model development\nFlexibility, excellence, and passion are vital qualities within Auton Lab. Inclusion, collaboration and cultural sensitivity are valued proficiencies at CMU. Therefore, we are in search of a team member who is able to effectively interact with a varied population of internal and external partners at a high level of integrity. We are especially interested in qualified candidates who can contribute through their work/life experiences to the diversity and excellence of the academic community.\n\nYou should demonstrate:\nExcellent communication skills\nAbility to work optimally in a team\nAre you interested in this opportunity with us? Please apply.\n\nMore Information:\n\nPlease visit “Why Carnegie Mellon” to learn more about becoming part of an institution inspiring innovations that change the world.\n\nA listing of employee benefits is available at: www.cmu.edu/jobs/benefits-at-a-glance/.\n\nCarnegie Mellon University is an Equal Opportunity Employer/Disability/Veteran.2.6Software Engineering Institute\n2.6Pittsburgh, PAPittsburgh, PA501 to 1000 employees1984College / UniversityColleges & UniversitiesEducationUnknown / Non-Applicable-1
953953Data Science Manager$95K-$160K (Glassdoor est.)Data Science ManagerResponsibilities:\n\nOversee a team of Data Scientists and Data Visualization Analysts who transform enterprise data into value drive insights\n\nDesign and implement processes for complex large-scale datasets for data mining, predictive modeling, and research purposes\n\nServe as an advisor for business stakeholders identifying data needs and explaining the importance and use of data applicable to their usage\n\nOversee development of a style guide detailing best practices standards for data visualization\n\nManage the intake process of analytics projects, measure value, and prioritize projects\n\nAlign the department as a customer-oriented service providing insights and information\n\nCoach and mentor team providing specific, timely and constructive feedback\n\nProvide day-to-day leadership and operational management in area of responsibility\n\nExecute objective, plans, and policies in line with enterprise level strategy\n\nProactively find new opportunities to leverage technology for continuous improvement and greater efficiency\n\nContribute to budget development and assist in preparation of operational plans for department\n\nOversee area of responsibility to adhere to approved budgets\n\nMS degree in a quantitative discipline plus a minimum of 5 years of professional work experience\n\nMinimum of 3 years of management experience\n\nProfessional work experience with R and advanced statistical modeling techniques including machine learning techniques\n\nExcellent oral and written communication skills\n\nExcitement, curiosity and passion for shaping the future through digital technology\n\nUS Citizenship or green card required3.2Numeric, LLC\n3.2Allentown, PAChadds Ford, PA1 to 50 employees-1Company - PrivateStaffing & OutsourcingBusiness Services$5 to $10 million (USD)-1
954954Data Engineer-1Loading...\n\nTitle: Data Engineer\n\nLocation: Austin, TX\n\nType: Contract\n\nJob #: WD50842311\n\nNote: This job is not open to C2C or 3rd party candidates.\n\nIGNW is an engineering-based resourcing company headquartered in Portland, OR with GEO based teams in Seattle, WA / Austin & Dallas, TX / Southeast US / California. We have global partnerships to deliver the industry's top technical solutions and talent to every one of our clients. Some of our strategic partnerships include Google Cloud, Hashicorp, Puppet, Cisco, and Docker.\n\nWHY IGNW?\n\nSo many reasons, one being IGNW being voted Glassdoor’s Best Places to Work in 2020 across the U.S. and eight other countries. See the details here https://www.thrillist.com/news/nation/glassdoor-best-places-to-work-2020. IGNW also earned the Best of Staffing Award for providing remarkable service to job seekers, hiring managers and current contractors. Check it out! https://www.bestofstaffing.com/agency/ignw/\n\nWe are proud to foster a great team working environment and offer highly competitive compensation and full benefits packages including medical, flexible spending accounts, dental, vision, 401k and more\n\nTake a look and see if this is a match…. or any other jobs posted!\n\nProject Scope:\n\nOur client is looking for a Data Engineer to manage databases and data sets, and performs data transformations and analyses.\n\nResponsibilities:\nCreate and maintain data pipeline architectures.\nAssemble large, complex data sets for project teams and data scientists.\nBuild the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL, NoSQL, and ‘big data’ technologies.\nDesign and implement automated solutions for common data transfer or transformation tasks.\nBuild analytics tools that utilize the data pipeline to provide actionable insights.\nWork with stakeholders to assist with data ­related technical issues and support their data infrastructure needs.\nKeep our data separated and secure across multiple data centers and cloud services.\nCreate data tools for data scientists that assist them in building and optimizing their models.\nDocument and version schema changes across all database stacks.\nRequirements:\n5+ years in a Data Engineering or similar position.\nPrevious system administration with linux and windows.\nMust have strong SQL skills.\nExtensive experience in managing large scale databases such as Postgres, MS SQL Server, or Oracle.\nExtensive experience with NoSQL databases such as HBase, Cassandra, or Dynamo.\nFluency in one or more scripting languages for data task automation such as Unix shell, Python, Ruby, TCL, or Javascript.\nExperience in data security and management of PII.\nPlease note: Candidates may need to pass a drug and/or background check.\n\nPlease send in your resume today and be sure to get a quick response from one of our onsite recruiters: resumes@ignw.io\n\nCheck out our reviews here: glassdoor.com\n\nTo view other IGNW opportunities please visit https://www.ignw.io/jobs\n\n#IGNW4.8IGNW\n4.8Austin, TXPortland, OR201 to 500 employees2015Company - PrivateIT ServicesInformation Technology$25 to $50 million (USD)Slalom
955955Research Scientist – Security and Privacy$61K-$126K (Glassdoor est.)Returning Candidate? Log back in to the Career Portal and click on 'Job Browsing/History' and find the job you're looking for.\n\n2019-024-OIC: Research Scientist – Security and Privacy\n\nDirectorate Open Innovation Center\nLocation Beavercreek, OH\nIf you want help develop the future technology to ensure security and privacy, Riverside Research’s Trusted and Resilient Systems group is the place for you. We are searching for an individual to join our research group to help shape a more secure future. The team has ongoing research in security of machine learning, cryptography, hardware and hypervisor security solutions, as well as developing cutting edge solutions to the security of open architecture systems. The ideal person for this position is passionate about many diverse areas technology and can leverage their interests to develop and study creative solutions to some of the most difficult challenges. The current team resides in Riverside Research’s Beavercreek, OH, but we are willing to consider candidates that would prefer to work out of one of our Washington DC (Centerville or Crystal City) offices, our New York City office, or our Boston office.\n\nJob Responsibilities:\n•Work with a team of highly skilled researchers to develop interesting and novel solutions to security and privacy problems\n•Publish and present research in conferences and journals\n•Work with the team to identify future areas of research investment and develop research plans\n•Assist with writing technical proposals\n\nQualifications:\n•Ability to obtain and maintain TS/SCI security clearance\n•Bachelor's or Master's degree with significant experience in security privacy research\n•Prior experience developing software\n•Ability to work independently and with a team\n•Superior written and verbal communication skills\nDesired Qualifications:\n\n•Python\n•Web development (we use React)\n•Revision control (we use Git)\n•Machine learning\n•Cryptography\n•Prior experience with government funded research\n\nRiverside Research strives to be one of America’s premier providers of independent, trusted technical and scientific expertise. As we continue to add experienced, technically astute staff, we are looking for highly motivated, talented team members that can help our DoD and Intelligence Community (IC) customers continue delivery of world class programs. As a not-for-profit, technology-oriented Defense Company, we believe service to customers and support of our staff is our mission. Our goal is to serve as a destination company by providing an industry-leading, positive, and rewarding employee experience for all who join us. We aspire to be a valued partner to our customers and to earn their trust through our unwavering commitment to achieve timely, innovative, cost-effective and mission-focused solutions.\n\nAll positions at Riverside Research are subject to background investigations. Employment is contingent upon successful completion of a background investigation including criminal history and identity check.\n\nThis contractor and subcontractor shall abide by the requirements of 41 CFR 60-741.5(a). This regulation prohibits discrimination against qualified individuals on the basis of disability, and requires affirmative action by covered prime contractors and subcontractors to employ and advance in employment qualified individuals with disabilities.\n\nThis contractor and subcontractor shall abide by the requirements of 41 CFR 60-300.5(a). This regulation prohibits discrimination against qualified protected veterans, and requires affirmative action by covered contractors and subcontractors to employ and advance in employment qualified protected veterans.\n\nApply Now3.6Riverside Research Institute\n3.6Beavercreek, OHArlington, VA501 to 1000 employees1967Nonprofit OrganizationFederal AgenciesGovernment$50 to $100 million (USD)-1